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To Laura and Julie 


Symmetry has accompanied our attempts to understand the inner workings of 
nature ever since the birth of science. The powerful concept of spontaneous 
symmetry breaking (SSB) makes it possible to reconcile the complex experimental 
phenomenology with the simplicity of underlying natural laws. It is therefore a 
necessary ingredient of virtually any textbook on quantum field theory. However, 
a typical textbook-level treatment offers a mere simplified version of the state of 
the art of the subject in its early days of the 1960s. This hardly does justice to the 
richness and beauty of SSB. Moreover, the subject has undergone development in 
the last two decades that has substantially sharpened our understanding of SSB in 
quantum many-body systems. 

This book is an attempt to fill the gap in the literature and to provide an up-to- 
date survey of the landscape of SSB from the perspective of effective field theory 
(EFT). In writing the book, I was aiming at learners of all seniority levels who are 
familiar with the basic notions of quantum field theory. The style of the text and 
the mathematical background required will probably be a best match for graduate 
students of theoretical physics. However, the introductory parts of the book might be 
accessible even to advanced undergraduates. Those will benefit from studying first 
Part I of the book, which introduces the concepts of SSB and the EFT description 
thereof at an elementary level. More experienced readers might proceed directly to 
Part II. This offers an overview of the physics of SSB at a standard textbook level. 
However, it already contains numerous details that are hard to find in a student- 
friendly form elsewhere. The core techniques of EFT for SSB are developed in 
Parts III and IV. These provide a comprehensive account of the subject, from the 
pioneering works of the 1960s to some recent developments that are otherwise only 
found scattered in the research literature. 

The most important recent results included are, as of writing this book, about 
a decade old. This applies in particular to the completed classification of Nambu-— 
Goldstone (NG) bosons (Chap. 6), the EFT formalism for spontaneously broken 
internal symmetries in quantum many-body systems (Chap. 8), and some applica- 
tions of EFT to systems with spontaneously broken spacetime symmetry (Chaps. 13 
and 14). However, some of the material included also reflects new works that 
appeared while I was already writing the book. This is the case especially for the 
discussion of scattering of NG bosons in Chap. 10. Finally, Chap. 12 puts forward a 
novel formalism for implementation of spontaneously broken spacetime symmetry 


vii 


viii Preface 


that, to the best of my knowledge, has not appeared in this form before. This 
significantly influences the narrative surrounding the EFT for broken spacetime 
symmetries in Chaps. 13 and 14. My hope is that the change is for the better. 

The one thing I have learned while writing this book is that one’s understanding 
of any topic is a never-ending process of evolution. I would therefore like to 
extend my gratitude to those who helped me along the way. First and foremost, I 
am indebted to Angelo Esposito, Yoshimasa Hidaka, and Jiří Novotny, who have 
read critically parts of the manuscript and given me constructive feedback, helping 
me improve the text on multiple fronts. My personal view of the subject of this 
book and of effective field theory in general has been shaped by interactions with 
numerous mentors and collaborators throughout the years, including Jens O. Ander- 
sen, Christoph P. Hofmann, Jiří Hošek, Martin Mojahed, Sergej Moroz, Hitoshi 
Murayama, Jiří Novotny, Riccardo Penco, Aleksi Vuorinen, Haruki Watanabe, and 
Naoki Yamamoto. Last but not least, I would like to thank Stefan Theisen for 
encouraging me to write this book. 


Stavanger, Norway Tomas Brauner 
September 2023 
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Notation and Conventions 


List of Acronyms 


Acronyms that appear in the book and their use in established collocations: 


ChPT Chiral perturbation theory 

CS Chern—Simons (Lagrangian, theory) 
DBI Dirac—Born-Infeld (Lagrangian, theory) 
DM Dzyaloshinskii—Moriya (interaction, term) 
EFT Effective field theory 

EM Energy—momentum (tensor) 

EoM Equation of motion 

GW Goldstone—Wilczek (current) 

THC Inverse Higgs constraint 

LC Levi-Civita (connection, symbol, tensor) 
LO Leading order 

MC Maurer—Cartan (equation, form) 

NG Nambu-Goldstone (boson, field, mode) 
NLO  Next-to-leading order 

QCD Quantum chromodynamics 

SSB Spontaneous symmetry breaking 

VEV Vacuum expectation value 

VPD ~~ Volume-preserving diffeomorphism 

WZ Wess—Zumino (action, term) 


For the reader’s convenience, each of these acronyms is also defined in the text the 
first time it appears in a particular chapter. 


Mathematical Conventions 


This book aims at audiences from diverse areas of physics, which makes it virtually 
impossible to maintain coherent notation without alienating some of the readers. I 
therefore try to adopt conventions that are as close as possible to practice, at the cost 
of being somewhat dependent on the context. 


XV 


xvi Notation and Conventions 


Thus, a “relativistic” notation is used to discuss physical systems whose dynam- 
ics is Lorentz-invariant. I assume a flat Minkowski spacetime unless (exceptionally) 
stated otherwise. Much of the content of the book is valid for spacetimes of any 
dimension D > 3; see Sect. 15.2 for a justification of this constraint. The timelike 
Minkowski metric, 


guv = diag(+1, —1, —1,...) , 


is implicitly assumed when needed. Spacetime vectors (or, in D = 4 dimensions, 
four-vectors) are denoted using italics: p,q,.... An inner product of spacetime 
vectors, implied by the Minkowski metric, is indicated with a dot, 


Pq = 8uvP"q” = pÄqp = ppg" . 


Here the Einstein summation convention is applied to any pair of repeated indices, 
one covariant and one contravariant. The Levi-Civita tensor €,,,... in Minkowski 
spacetime is normalized so that ¢°!*"" = 1. Finally, I consistently use the “natural 
units” of high-energy physics where both the speed of light c and the reduced Planck 
constant fi are set to one. Thus, the energy and momentum of a relativistic particle 
are identified respectively with the frequency and wave vector of the corresponding 
quantum-mechanical plane wave. Moreover, the Minkowski square of the energy— 
momentum of the particle is p? = p- p = m?, m being the rest mass of the particle. 

On the contrary, in systems lacking manifest Lorentz invariance, typical for 
applications to condensed-matter physics, I adopt a “nonrelativistic” notation. Here, 
spatial vectors are denoted using boldface italics: a, b, .... The Einstein summation 
convention is still used, but indices are no longer raised or lowered with the 
Minkowski metric. Instead, upper and lower indices are used interchangeably just 
for visual clarity and are treated as equivalent. Formally, this amounts to using the 
flat Euclidean metric, g;;5 = 6,5. The dot product of two spatial vectors can then be 
written in a number of equivalent forms such as 


a-b=5,,a'b* = a,b, =a'b’ =a'b, = a,b" . 


The symbol V is used for the gradient operator, that is, a spatial-vector differential 
operator with components 0, = 0/0x’. Wherever needed explicitly, the number of 
spatial dimensions is denoted as d = D — 1. 


Part I 


Prologue 


2 


Check for 
updates | 


1.1 What Is Effective Field Theory? 


The laws of nature have a hierarchical structure, stratified by the resolution scale 
of our observations. When Isaac Newton published in 1687 his gravitational law 
explaining Kepler’s laws of planetary motion, he did not need to know the details of 
the planets’ inner structure. His theory gives an accurate description of the dynamics 
of the solar system while treating the planets as point-like objects. 

The phenomenological understanding of the basic material properties of solids, 
liquids and gases culminated in the mid-ninetenth century through the work of 
Clausius, Gibbs, Kelvin, Maxwell and others. Much about these different phases 
of matter and the transitions between them can be, and was, learned without insight 
into the molecular structure of matter. It was only about a half-century later that 
quantum mechanics provided an adequate framework for the description of atoms 
and molecules. 

Quantum mechanics itself is extremely successful in explaining the chemical 
properties of different substances and mechanical properties of solids, liquids and 
gases. In fact, virtually all natural phenomena observed at macroscopic scales boil 
down either to gravity or to Maxwell ’s electromagnetism combined with quantum 
mechanics. Up to rare exceptions, for instance radioactivity, one does not need to 
understand the structure of atomic nuclei. Nuclear physics itself only advanced later, 
in the 1930s. And again, much about nuclear structure and reactions was understood 
before the discovery in the 1960s that individual nucleons are composed of yet 
smaller particles, the quarks. 

The smallest currently known constituents of matter are quarks, bound in atomic 
nuclei, and leptons, of which the electron is but one example. How do we know 
that the chain of successive divisions into smaller and smaller constituents ends 
here? We do not. Perhaps, one day, even smaller building blocks of nature will be 
discovered. Nevertheless, we do have a successful theory that is able to account for 
experimental observations at the current resolution frontier: the Standard Model of 
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particle physics. This theory does not depend on the details of as yet undiscovered 
microscopic physics. Our ignorance of such microscopic structure currently out of 
reach is subsumed into the values of the input parameters of the Standard Model. 

This is the essence of effective field theory (EFT). Going back to where 
we started, the information about the detailed structure of planets, unknown to 
Newton, could be subsumed into the values of the planets’ masses. Likewise, 
the quantum-mechanical nature of molecular interactions only affects macroscopic 
thermodynamics through material constants such as specific heats or elastic moduli. 
All that quantum mechanics in turn needs to know about atomic nuclei are their 
masses and electric charges. Finally, the quark structure of nucleons only manifests 
itself through binding forces between nucleons that hold the nuclei together. These 
nuclear forces were studied thoroughly already in the mid-1930s by Fermi, Yukawa 
and others, three decades before the existence of quarks was conceived by Gell- 
Mann and Zweig. 

Generally speaking, EFT is a framework for a quantitative description of nature 
at certain level of resolution. Whatever physics might exist at shorter, unresolved 
length scales can be taken into account through the values of the parameters of 
the EFT. In the absence of further information about the microscopic physics, 
these parameters have to be determined by experiment. This line of reasoning 
has implicitly accompanied the evolution of physics from its early stages. In the 
second half of the twentieth century, it was however developed into a full-fledged 
quantitative formalism with tremendous impact, from concrete physical applications 
to the philosophy of science. One of the founders of modern EFT, Steven Weinberg, 
offers a nice account of the early history of EFT in [1]. The array of successful 
applications of the EFT program has grown so long in the last decades that I cannot 
even list them all here. Instead, I point the reader to the literature. References 
to introductory-level expositions of EFT with applications are given below in 
Sect. 1.3.1. 


1.2 Broken Symmetry Zoo 


Most macroscopic phenomena observed in nature are dominated by the collective 
dynamics of the fundamental constituents of matter. In hindsight, this is the 
reason why nineteenth-century physics was so successful in describing nature. 
Consider, for instance, a single-component fluid in thermodynamic equilibrium. 
All the thermodynamic properties of the fluid are determined by the values of two 
independent observables such as temperature and pressure. If we now disturb the 
fluid locally, the interactions among the fluid molecules will cause the perturbation 
to propagate. This propagation manifests macroscopically as a sound wave, possibly 
accompanied by a background flow. Such small perturbations from thermodynamic 
equilibrium are described by hydrodynamics, which is an early example of an EFT 
of the modern type. The microscopic input required here is the equation of state of 
the fluid along with a set of transport coefficients. Other than that, hydrodynamics 
is based only on a set of local conservation laws including energy, momentum and 
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Fig. 1.1 Schematic picture of the ordered state of a ferromagnet below the Curie temperature Tc 
and of the disordered state above the Curie temperature Tc 


whatever other conserved charge the fluid might carry. We know from Noether’s 
theorem that local conservation laws are a consequence of continuous symmetries. 
One can therefore say that the dynamics of fluids near equilibrium is largely dictated 
by symmetry. That is typical of spontaneous symmetry breaking (SSB). 

Let me illustrate this on another example. Ferromagnets are materials which, 
below certain temperature called the Curie temperature, exhibit spontaneous align- 
ment of magnetic moments of their atoms. A solid ferromagnet can be pictured as a 
lattice, each of whose nodes carries a single spin degree of freedom, see Fig. 1.1. The 
mutual interactions between the spins may be perfectly isotropic, that is invariant 
under spatial rotations. Yet, the ferromagnetic state below the Curie temperature 
obviously possesses a preferred direction of alignment of the spins. This direction 
is a priori arbitrary and may be set for instance by boundary conditions, not by the 
internal dynamics of the ferromagnet itself. The emergence of an equilibrium state 
that breaks the intrinsic symmetry of the system is a hallmark of SSB. 

A local perturbation of the ferromagnetic state will be propagated by spin— 
spin interactions and manifest itself macroscopically as a spin wave. The existence 
of wave-like excitations which dominate the physics at long distances is another 
general feature of SSB; these are called Nambu—Goldstone (NG) bosons. Just like 
for sound waves in fluids, the dynamics of ferromagnetic spin waves is controlled 
by symmetry. A quantitatively accurate EFT of spin waves can be constructed solely 
based on the underlying spacetime translation invariance, spatial rotation invariance, 
and the fundamental commutation relations of angular momentum (spin). 

The above two examples hint at a particular type of EFT that governs the long- 
distance physics of systems with SSB. The relevant degrees of freedom of this EFT 
are the NG bosons whereas the details of the EFT are dictated by symmetry. It 
should therefore not come as a surprise that many of the techniques developed in this 
book rely on the theory of Lie groups, and to a lesser extent on differential geometry. 
The precise mathematical structure of the EFT depends on the kind of symmetry in 
question. It is common to use the term internal symmetry for symmetries that act 
directly on the dynamical degrees of freedom of a given physical system. SSB of an 
internal symmetry lies for instance behind the phenomena of (anti)ferromagnetism 
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or superfluidity. On the other hand, a spacetime symmetry is a geometric property 
of space and time. It affects the dynamical degrees of freedom indirectly as fields 
living in the physical spacetime. SSB of spacetime symmetries is more subtle 
yet ubiquitous and relevant for all phases of matter. This is the reason why the 
form of hydrodynamics is largely fixed by the local conservation laws of energy 
and momentum. A different pattern of SSB gives rise to the elasticity theory of 
solids. There are also examples of phenomena that exhibit a combination of SSB 
of internal and spacetime symmetry. These include for instance vortex lattices in 
rotating superfluids or type-II superconductors, anisotropic Cooper pairing in p- 
wave superconductors, or the helical order in chiral magnets. 


1.3 Structure of This Book 


The subject of this book is of central importance to quantum field and many-body 
theory. As such, it connects several branches of physics including, but not limited to, 
high-energy physics, condensed-matter physics, astrophysics and cosmology. With 
the explicit aim to cater to these different communities, I have tried my best to make 
the text relevant and comprehensible to audiences with different backgrounds. It is 
up to the reader to judge to what extent this effort has been successful. 


The book aims primarily at graduate students of theoretical physics regardless 
of their concrete specialization. The text should in principle be accessible to 
anyone who has taken a first course on quantum field theory. With this target 
group in mind, I have interspersed the text with two didactic elements. On the 
one hand, there are numbered examples, graphically separated from the main 
text, that mostly serve to illustrate newly introduced theoretical concepts or 
arguments. Occasional more advanced examples mention interesting applica- 
tions that would disrupt the line of discussion if included in the text. On the 
other hand, gray blocks such as this one point to important facts or subtleties 
that might otherwise be missed. They are meant to encourage the reader to 
take a critical look at the presented material. 


The mathematics background required to benefit from this book likewise cor- 
responds to that of a typical student of theoretical physics. It consists mostly 
of linear algebra, advanced calculus including complex calculus and the calculus 
of variations, and the theory of partial differential equations. The only piece of 
background I assume that might go beyond the curriculum of some graduate 
programs is certain familiarity with group theory and its applications to physics. 
The level and extent as covered by Georgi [2] is fully sufficient; Chaps. 14 and 15 
of [3] are a good start that will get the reader very far. Some more advanced parts 
of the book rely on basic background in differential geometry. Everything that is 
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needed in this regard (and presumably more) is covered in a self-contained manner 
in Appendix A. 

In order to make the book potentially useful also for more experienced 
researchers, the content is composed of several layers. In the following Chaps. 2 
and 3, I introduce the concepts of SSB, NG bosons and the EFT description thereof 
through the case study of a simple toy model. This is intended for an uninitiated 
reader. Anybody else can proceed directly to Part II. Chapters 5 and 6 of this part 
give a thorough introduction to the physics of SSB. This serves as a supplement to 
advanced courses on field theory and may thus be suitable for graduate students or 
junior researchers. An expert reader can skip this part as well. 

Parts III and IV constitute the core of this book that will bring the reader 
close to the research frontier. Part III is dedicated to spontaneously broken internal 
symmetries. Here the EFT methodology is by now well-settled, and is developed 
in detail in Chaps. 7 and 8. Chapters 9 and 10 work out some concrete applications 
of the general formalism. In contrast to internal symmetries, spontaneously broken 
spacetime symmetries remain a rather active area of research with a number 
of open questions left to answer. The choice and organization of material in 
Part IV is therefore necessarily more subjective. I however try to draw on the 
analogy with broken internal symmetries as much as possible. This largely fixes 
the content of Chap. 12, which details the necessary modifications to the techniques 
of Chap. 7. The following Chaps. 13 and 14 then develop the EFT for spontaneously 
broken spacetime symmetry using numerous examples, roughly sorted by increasing 
complexity. 

In spite of the rather large extent of the book, I was forced to make compromises 
regarding the choice of material. Sadly, some very exciting aspects of SSB had to 
be left out altogether. Covering them at the same level of detail as in the rest of the 
book would have required a substantial amount of additional background. Some of 
these topics are mentioned at least briefly in Part V. 

Let me conclude this introduction with a remark on the chosen style of the book. 
In writing the text, I have not tried to give a review of all existing research literature 
on the subject. My intention was instead to create a unified narrative that would give 
the reader the tools necessary to critically assess existing results and approaches, 
and to launch a research program of their own. In line with this philosophy, I have 
deliberately adopted a very restrictive policy regarding bibliography. The purpose 
of references in this book is primarily to aid the reader. Generally, I thus only 
include references to point the reader to specified supplementary information, or 
occasionally to fill in a gap in the argumentation. Only when I explicitly borrow an 
idea or example from a single identifiable source, do I credit this with a citation. I 
therefore apologize to all those who might be missing their name in the reference 
list. 
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1.3.1 Further Reading 


This book revolves around the application of EFT to physical systems with SSB. 
Most of the graduate-level physics background that is needed to follow the advanced 
parts of the book is covered in Part II. This however does not mean that the text is 
comprehensive. Here is a (biased) list of suggestions for readers who wish to learn 
more details than what this book can cover. 

A monograph fully devoted to EFT was missing on the market for years. This gap 
has now been filled by the new opus by Burgess [4]. A reader seeking a more concise 
introduction to EFT will find a variety of excellent resources with slightly different 
balance of topics included. Thus, for instance, [5] and [6] cover a spectrum of 
applications to particle phenomenology and nuclear physics. Reference [7] includes 
a discussion of physics beyond the Standard Model and of the application of EFT to 
fluid dynamics, which is hard to find in a student-friendly form elsewhere. Cohen’s 
lectures [8] are a comprehensive source on EFT with a very useful list of further 
references including annotations. There are also great texts that mostly focus on 
EFT for NG bosons. This is true especially of [9, 10] and to some extent of [11]. 

SSB is rarely a subject of a dedicated monograph. An exception is the classic 
by Strocchi [12], which will satisfy a mathematically oriented reader desiring a 
rigorous treatment of SSB. The comprehensive early review [13] remains a valuable 
source of insight, including details that seem to have been forgotten by practitioners. 
The lecture notes [14] give a modern introduction to SSB that covers recent 
developments. They omit a discussion of EFT for NG bosons though, and are thus 
nicely complemented by [15] which focuses largely on the latter. Finally, the specific 
topic of classification of NG bosons is addressed in detail by [16]. 
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The purpose of this chapter is to introduce the uninitiated reader to the key concepts 
underlying the book. I assume that the reader has taken a first course on quantum 
field theory, but has not necessarily heard about spontaneous symmetry breaking 
(SSB). The field theory tools we are going to need include identification of a 
continuous symmetry and the corresponding conserved current via Noether’s theo- 
rem, extraction of interaction vertices from the Lagrangian and their representation 
by Feynman diagrams, and their use in a perturbative calculation of scattering 
amplitudes. 

I will start straight away by introducing a simple toy model for a complex scalar 
field @ and a Dirac field Y, defined by the Lagrangian density 


L = d p* 3 p + m*o — A*S + (€*b + eg“) 


(2.1) 
+ Vid — g(UL Vo + YrRYLG*) . 

Here m, à and g are parameters that are assumed to be real and positive but otherwise 
arbitrary. The parameter € is complex and should be sufficiently small in a sense 
made more precise below. Finally, the usual slash notation indicates contraction of 
a Lorentz vector with Dirac y-matrices, thus g = y“d,. I am not going to explain 
the motivation for the choice of this particular model. Let us rather take this as an 
invitation to explore with open mind its interesting physical properties. 


I have chosen the relativistic notation just to make the mathematical anal- 
ysis of the model as simple as possible. It is not essential for qualitative 
understanding of the results; much of the calculation could be repeated with 
nonrelativistic (Schrödinger) fields. This would of course require some minor 
modifications of the Lagrangian (2.1). First, the kinetic terms for @ and Y 


(continued) 
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would have to be changed. Second, the chiral components Wy p of the Dirac 
field YW would have to be replaced with two species of nonrelativistic spin-1/2 
fermions. 


Apart from Poincaré symmetry, guaranteed by the manifestly relativistic setup, 
the model (2.1) has two natural symmetries, 


>o, W — exp(iey) VY (exact) , 
(2.2) 
ob —> exp(—2iea)d , W — exp(ieays) VY (approximate) . 


In the context of relativistic theories with fermionic degrees of freedom, the trans- 
formations with the parameters ey, a are known respectively as vector and axial. 
The axial transformation only becomes a true symmetry of the Lagrangian (2.1) 
in the limit € — 0. For small nonzero e€, it is therefore sensible to think of it as 
an approximate symmetry. By means of Noether’s theorem, the vector and axial 
symmetries imply the existence of the following currents and conservation laws, 


Jt = Wy", JK = Wy" ysv — 2i(g* a" — a" G*) , aa 
att =0, au Ih = ile" — ep") . . 


The expression for 0,, J N confirms the observation that the axial symmetry becomes 
exact in the limit € —> 0. 

In the rest of this chapter, we will analyze the physical properties of our toy 
model. To that end, we will utilize the basic workflow of perturbative quantum field 
theory, rooted in the theory of oscillations of mechanical systems [1]: 


e Find the ground state (Sect. 2.1). This is the mandatory first step for any quantum 
system and is carried out by minimizing the (classical) Hamiltonian of the model. 

e Identify the spectrum of excitations above the ground state (Sect. 2.2). This is 
based on the part of the Lagrangian, bilinear in the fluctuations of the fields ¢, Y 
around the ground state. 

e Work out the physical consequences of interactions of the excitations (Sect. 2.2). 
This follows from the part of the Lagrangian of higher orders in the fluctuations. 


Finally, in Sect. 2.3 we will see how the most distinguishing features of the model 
can be captured by a low-energy effective field theory (EFT). 
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2.1 Spontaneous Symmetry Breaking 


As the first step, we would like to find the (classical) ground state of the model (2.1). 
To that end, we need the classical Hamiltonian density, 


H = dnb" dop + VO" -Vo + VO, o*) — Viy - VY + g(ULVRd + VRYLO") , 
(2.4) 


where the scalar potential V(¢, $*) is given by 
V(b, $*) = -m*o + AG")? — (“p + EG") . (2.5) 


In the ground state of any quantum system, the vacuum expectation value (VEV) 
of a fermionic field must be zero. We can therefore focus on the scalar sector of 
the Hamiltonian (2.4). The part thereof containing derivatives of ¢, * is positive- 
semidefinite. Hence, the lowest-energy state will be such that the VEV of @, (@), 
is a coordinate-independent constant. The value of this constant is determined by 
minimizing the potential V(¢, $*). It is rather obvious that the ground state is 
nontrivial in that (@) + 0. First, for any nonzero e, the first partial derivatives of 
V (p, o*) at @ = ġ* = 0 are nonzero as well. Second, even in the limit € > 0, 
the Hessian matrix of V(¢, *) at p = ¢* = 0 is negative-definite. In the jargon 
of high-energy physics, this is a consequence of the “mass term” m7o*@ in (2.1) 
having a “wrong sign.” 

What is the ground state then? The VEV (@) must be a solution to the condition 
0 = aV/dd* = —m7o + 21¢(6*) — €. It must therefore have the same complex 
phase as €, possibly up to an overall minus sign. Let us introduce the notation 


€= —e” , (o) = —e” , (2.6) 


where @ is the common complex phase and v,w are real. The value of v 
characterizing the scalar VEV is then related to the parameter w by 


vv —m)=w. (2.7) 


Being a cubic equation, (2.7) has up to three real solutions for v. In the limit w — 0, 
these are v = 0, +m//2. Note that for € = 0, the potential V ($, 6*) only depends 
on ¢*¢@ and thus is insensitive to the phase 0. There is then a continuum of states of 
lowest energy, having v = +m/,/A and arbitrary 6. This can be traced to the fact 
that for € = 0, the axial transformation in (2.2), which changes the phase of @, is an 
exact symmetry of the Lagrangian (2.1). For any field configuration ¢ (x), the field 
el? h(x) with constant 0 therefore has the same energy as ¢ (x) itself. The existence 
of degenerate ground states is a hallmark of SSB. The defining property of SSB, in 
the context of our toy model, is that the VEV (¢) is not invariant under the axial 
transformation (2.2). The ground state has a lower symmetry than the Lagrangian. 
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The reader might be concerned that having multiple degenerate ground states 
could lead to subtleties. (It does.) In any case, this was not anticipated by our 
algorithmic workflow outlined below (2.3). A simple workaround is to keep nonzero 
€, which ensures the existence of a unique state of lowest energy. For positive w, 
this is the solution of (2.7) with the highest value of v. Using the closed formula for 
the solutions of a cubic equation would be impractical. It is however easy to find 
an approximate expression for v as long as w is small enough. One just needs to 
rewrite (2.7) in a form suitable for iteration, 


m 14 w m 14 Vw m 4 w 4 (2.8) 
VaV mu SA 2m? Va 2m? 


We end up with an infinite series with the expansion parameter VAw/m?. This gives 
us a precise mathematical condition for w being “small enough,” w « m?/V/i. 
The condition guarantees that the value of v in the ground state differs very little 
from m/v. In other words, € now acts merely as a perturbation that enables us 
to pin down a unique ground state, without modifying this state appreciably. In the 
following, I will implicitly assume that the condition w < m3/,/1 is satisfied. 


2.2 Nambu-Goldstone Boson and Its Interactions 


The next step is to identify the spectrum of excitations above our ground state, and 
to work out the consequences of their interactions. To that end, we need to find 
field variables with vanishing VEV that make the bilinear part of the Lagrangian 
diagonal. Such variables correspond to the normal modes, well-known from the 
classical theory of small oscillations [1]. Upon expansion in powers of the new 
fields, the bilinear part of the Lagrangian determines the spectrum of normal modes, 
and the higher-order parts their interactions. 

The change of variables to the normal modes can be thought of as a choice 
of parameterization of ¢@. (The Dirac field W already has a vanishing VEV, so 
we do not have to do anything about it.) There is no a priori preferred choice of 
parameterization. Below, I will illustrate two different options, one of which is 
intuitively natural, whereas the other is more physical and practically convenient. 


2.2.1 Linear Parameterization 


One obvious possibility how to parameterize ¢ is to shift it by its VEV, and represent 
the resulting complex field in terms of its real and imaginary parts. This is still not 
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completely unambiguous, but a smart choice is for instance! 


id 
pax) = ale +(x) tind). (2.9) 


Taking out an overall factor e? ensures that the phase 8 drops out of the Lagrangian. 
Inserting this parameterization into (2.1), we get a Lagrangian in terms of x, 7, Y 
that can be organized by the dependence on the scalar fields, 


L = Lac + Lilin + Git + Lv . (2.10) 


The piece independent of the fields, ac = (3/ A)avt — (1 / 2)m? v2, determines up 
to a minus sign the energy density of the ground state. There are no terms linear in 
X, T; this is ensured by their definition (2.9) through removing from @ its VEV. The 
bilinear part of the Lagrangian is of greatest interest to us, 


1 2_ 1 22,1 21 2 2 
Shilin = z ux) = 5”xX + z Our) Z gaT > (2.11) 
where the mass parameters are given by 
2 2 3w 2 w 
m, = 2m“ + — , m == (2.12) 
v v 
An obvious corollary is that in the limit of exact axial symmetry (€ = 0), the 


field m becomes massless. This is our first Nambu—Goldstone (NG) boson. It is 
easy to see the origin of the massless mode in the spectrum. With a canonically 
normalized kinetic term for @ in the Lagrangian, the mass spectrum is determined 
by the eigenvalues of the Hessian matrix of the potential V(@, ¢*) in the ground 
state. But I have already argued that for any $(x), changing its overall phase 
will give a configuration of the same energy. Hence for any @¢ # 0, there 
is a direction in the field space in which the potential does not change. This 
guarantees that the Hessian has an eigenvector with zero eigenvalue. The existence 
of the NG boson is therefore a direct consequence of the axial symmetry and its 
spontaneous breakdown, independent of the specific Lagrangian. This is the essence 
of Goldstone’s theorem [2,3]. 

Equation (2.12) likewise tells us that in the limit of exact axial symmetry, that is 
€ = 0, the mass of x approaches m4/2. The massive counterpart of the NG boson is 
usually referred to as the Higgs boson (or Higgs mode). The interaction part of the 


' The somewhat unusual notation is required for compatibility with the later discussion in Chap. 7. 
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scalar Lagrangian in (2.10) consists of cubic and quartic terms, 

Boe = Aux? +9) OË Hn. 2.13) 
Finally, the fermion part of the Lagrangian reads 


Ly = vid — my)b — vx + iysm)y , (2.14) 


where I have redefined the Dirac field by Y = exp[—(i/2)6y5] w in order to bring 
the kinetic term into the standard Dirac form. We can see that the originally massless 
fermion has acquired mass, my = gv/ /2. This is an example of generation 
of fermion mass by spontaneous breaking of chiral symmetry, which plays an 
important role in the Standard Model of particle physics. 


2.2.2 Scattering of Nambu-Goldstone Bosons 


Now that we know the mass spectrum of our model, let us see how the different 
particles interact with each other. We are going to need Feynman rules for the 
interaction vertices, encoded in the Lagrangians (2.13) and (2.14), 


ig _ 8Y5 
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The undirected solid and dashed lines represent respectively x and x. The oriented 
solid lines stand for the fermion y. These Feynman rules for the interaction 
vertices must be augmented with standard relativistic propagators for x, 2, y with 
respective masses my, Mz, My. 

I will work out a few sample scattering processes including the NG boson z. 
All of these processes will have the simple four-particle kinematics displayed in 
Fig. 2.1. In all cases, I will use the same notation p1, p2 for the energy-momentum 
of the incoming particles and p3, p4 for the energy-momentum of the outgoing 
particles. For the sake of brevity, I will also use the Lorentz-invariant Mandelstam 
variables s = (pı + p2}, t = (pı — p3) and u = (pı — p4)*. As a consequence of 
energy and momentum conservation, the Mandelstam variables satisfy the constraint 
stttu = mi+m5+m3+m3, where mı, m2, m3, m4 are the masses of the particles 
participating in the scattering process. 


(2.15) 
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SY 


Fig. 2.1 Kinematics of the four-particle scattering processes discussed in this chapter. The arrows 


indicate flow of energy-momentum. In other words, the initial state includes two particles with 
momenta pı, p2, whereas the final state two particles with momenta p3, p4 


Let us first inspect 7 — mz scattering. The invariant amplitude for this process 
is given in terms of Feynman diagrams by 


+ 


\ / \ / `e’ `a / 
. \ 7 \ / V 
AA ga san = e + @—@e + 
\ 
7 \ / \ Mi Sa r \ (2.16) 


Using the Feynman rules (2.15), this evaluates to 


22 1 1 1 
Annan = 64+ 4A v zt zF x]: (2.17) 
s — m% % u= m} 

Of particular interest is the behavior of this amplitude at low energies. Given that 
the NG boson z becomes massless in the limit € —> 0, we can certainly find a 
kinematical regime such that s, t, u <« m? for sufficiently small w. It then makes 
sense to expand the amplitude in powers of the Mandelstam variables. Upon a bit of 
manipulation using (2.7) and (2.12), one finds 


2 +t? +u?) OW? t, u°). (2.18) 


Annan = 


4w / > w 412v? 
4 (m ) g 6 
mu 2v my 

Interestingly, the leading term vanishes in the limit € — 0. But that is not all. In 
the same limit, we find that s = 2p; - p2 = 2p3- p4, t = —2p1 - p3 = —2p2- pa 
and u = —2pı ;- p4 = —2p2 - p3. If we now take the additional limit in which 
the momentum of any of the four particles goes to zero, then all the Mandelstam 
variables, and thus the amplitude A; -.77, will vanish. This property is concealed 
in (2.17), where all the four individual contributions coming from the diagrams 
in (2.16) have a nonzero limit. The algebraic cancellation leading to the eventual 
vanishing of the amplitude requires an interplay of the cubic and quartic interactions 
in the Lagrangian. This seems to be too much of a coincidence. In fact, it is our first 
example of the Adler zero principle: scattering amplitudes of a NG boson vanish 
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in the limit that its momentum goes to zero. In plain terms, low-energy NG bosons 
interact weakly. 

I have just made a much stronger statement than what the single amplitude 
worked out so far would seem to justify. We need more examples to check whether 
the Adler zero principle actually holds. To keep things simple, I will from now 
on take the limit of exact axial symmetry, € = 0, in which the NG boson z is 
exactly massless. The next example we will look at is ym — xz scattering, with 
the amplitude 


“Wenn = Nn t pari + Bid : y i 
/ \ / \ ES - \ 
7 oe sn 


7 \ / ` \ (2.19) 
Here a straightforward application of the Feynman rules gives 
,2f1 3 1 

Ayn—oyn = 2A +4 v | -+ ee ie (2.20) 
S t—m x u 


Suppose that we again take the momentum of one of the NG bosons, p2 or p4, to 
zero. In this limit, s, u —> m? = 2m? whereas t —> 0. Using finally the fact that 
v = m/v, it is easy to see that the whole amplitude Aynx—> xyr Vanishes. 

We are starting to see a pattern. We have now checked two different scattering 
processes, which together involve all the scalar interaction vertices in (2.15) except 
for the x* one. The Adler zero principle seems to hold, but it emerges out of a 
nontrivial interplay of different interaction terms and a cancellation among different 
Feynman diagrams. 

As the last example, consider the scattering of a NG boson off a fermion, wa —> 
wa. The invariant amplitude for this process is 


/ \ N 
/ \ © = Fd 
r br we Oe 


/ \ z (2.21) 
The Feynman rules (2.15) now give us 
A Fal ( ~ta ro 
yr>yr = Ups u(pı 
E 2 pi+ptmy p,- p+my 
(2.22) 


2A v_ 
pA a(ps)u (pi) , 
t — m3 


2.2 Nambu-Goldstone Boson and Its Interactions 19 


where u(p1) and u(p3) are the usual plane-wave Dirac spinors with polarization 
indices suppressed for brevity. In order to further simplify the expression for the 
amplitude, we invert the Dirac propagators and use the Dirac equation for the Dirac 
spinors, (p; — my)u(pi) = U(p3)(p3 — my) = 0. Using finally the fact that 
s— m, = 2pı ; p2 and u — mi, = —2p2- p3, the amplitude (2.22) can be cast as 


J/2hg 
t—m 


g (2+ 


v 
= 2u u : 2.23 
Ayr>yr = g P) ae T z4(ps)u(pı). (2.23) 


x 


Juco 


This form is suitable for checking the limit p? —> 0. Should one prefer to take 
the momentum of the other NG boson, p4, to zero, it is possible to replace the 
expression in the large parentheses with p,/(p3- p4) + p4/(pi - p4). Either way, 
the amplitude does not go to zero in the limit of vanishing momentum of the NG 
boson, in contrast to the previously discussed scalar amplitudes. The problem lies 
in the first term in (2.23), which does not even have a well-defined limit when p2 or 
p4 is taken to zero. That is because the fermion propagators become singular in this 
limit. I will only be able to offer a proper discussion of this issue in Chap. 10. For 
the moment, it suffices to keep in mind that the Adler zero principle has exceptions. 


2.2.3 Nonlinear Parameterization 


The whole preceding discussion of scattering amplitudes in our toy model was based 
on the linear parameterization (2.9). However, as already pointed out, this is not the 
only choice. Let us try something else. In elementary complex calculus, one learns 
about two different representations of complex numbers: the linear (“Cartesian”) 
and exponential (“polar”). I will thus replace (2.9) with 


ellO+2(x)/v] 
V2 


With this parameterization, it is natural to think of x as the fluctuation of the 
magnitude of ¢, and of x as the fluctuation of its phase. 

We can now follow the same steps as in Sect. 2.2.1. Using the partition (2.10) 
of the Lagrangian, we find that the constant part “jac and the bilinear part -ilin 
remain unchanged. This follows from the fact that we are expanding around the 
same ground state as before, and that the parameterizations (2.9) and (2.24) agree to 
first order in x, 2. The scalar interaction Lagrangian is now different though,” 


ġa) = [v + x(x]. (2.24) 


x 2 
Zu = vx? i (2 + am? (2.25) 


? Remember that I have already set € = 0. Otherwise, the interaction Lagrangian would include 
additional terms proportional to w. This is in contrast to (2.13), which is correct for any € € C. 
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Likewise, the fermionic part of the Lagrangian (2.14) turns into 


— — 0 — 
Ly = Vid — myv - ax $ Syl ys (2.26) 


where I have redefined the fermion field as W = exp[—(i/2)(6 + m/v) ys]. 

Note that the NG field z now enters the Lagrangian only through its derivatives. 
This remarkable feature is a direct consequence of the exponential parameteriza- 
tion (2.24). Namely, since 7 (x) appears therein together with the constant phase 0, 
any constant field x must drop out of the Lagrangian as a consequence of the axial 
symmetry. Another way to look at this is that the axial transformation (2.2) acts on 
the fields x, x, Y via 


Xx>xX, T —> T — 2VEA , wow. (2.27) 


While x and yw are left intact by the axial transformation, z is shifted by a constant. 
Hence, there is no other way to construct a Lagrangian invariant under the axial 
symmetry than to forbid operators containing x without any derivatives. This is 
typical for NG bosons, and it gives us insight into the origin of the Adler zero 
principle. If the NG field only interacts via its derivatives, it should not be surprising 
that its interactions are suppressed at low energies. 

Let us check how this works explicitly. We will need the Feynman rules for the 
interaction vertices contained in (2.25) and (2.26), 


/ 
( / 2i ig 
= —6iùv , —e =--p-q, =-—, 
\ v V2 
1 \ 
-£ =ar. X =n. X E 
` (2.28) 


The notation is such that p, q are energy-momenta carried by the NG legs, oriented 
towards the vertex. A comparison with (2.15) shows that, unsurprisingly, the 
interaction vertices involving only x or yy remain unchanged. The vertices of 7 
now depend on energy-momentum though. 

Since there is no z+ vertex, the zm —> mz amplitude is now given by the last 
three diagrams in (2.16). A brief calculation leads to 


1 s? t? u? 
Ansan = 7 + 7 + 7 ; (2.29) 


2 = & _ 
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which is easily seen to be equivalent to our previous result (2.17) in the limit € > 0. 
Note, however, that in the exponential parameterization (2.24), the Adler zero 
property is manifest. Each of the three contributions to (2.29) vanishes separately 
when the momentum of any of the four particles is taken to zero. There is no need 
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for cancellation between different Feynman diagrams. This is a direct consequence 
of the fact that the NG field x is now derivatively coupled. An important lesson 
of this exercise is that while the scattering amplitude can be calculated in any 
parameterization of ġ, not all parameterizations are equal. Indeed, some make the 
physical properties of the amplitude manifest. 


I have glossed over the fact that the scattering amplitude A;z-.77, and in 
fact the whole S-matrix, is independent of the choice of parameterization 
of @ around its VEV. This may seem intuitively obvious. After all, it is 
common lore that physical predictions of a theory should not depend on 
arbitrary choices such as a reference frame or a coordinate system. Yet, 
the mathematical proof of such reparameterization invariance of the S- 
matrix is nontrivial [4,5]. Moreover, the independence on the choice of 
parameterization really only applies to physical observables. Quantities that 
are in principle unobservable, such as off-shell Green’s functions of fields, 
may differ in different parameterizations. Finally, verifying the reparameter- 
ization invariance of the S-matrix explicitly beyond the tree-level (classical) 
approximation may require taking carefully account of the Jacobian of the 
functional integral measure [6]. 


The next amplitude to check is that for the xm — xz process. In this case, the 
same four diagrams as in (2.19) contribute, although the individual graphs of course 
take different values in the two parameterizations (2.9) and (2.24). A straightforward 
application of the Feynman rules (2.28) leads to 


1 | i (s — m?) 3m? t (u — < (2.30) 


A m—>xn = t 
x x v2 S t-m? u 


which is equivalent to the previous result (2.20). In its present form, the amplitude 
however vanishes manifestly, diagram by diagram, in the limit where the momentum 
of one of the NG bosons is taken to zero. 

The last amplitude we are interested in is for wa — wa. Again the same 
diagrams as in (2.21) contribute, but again they take individually different values 
than before due to the different Feynman rules (2.28). The result can be written 
in a number of different forms; one that is not too far from a direct application of 
Feynman rules is 


1_ (pi — my) (p — my) 
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This time it takes more effort to prove equivalence with our previous result (2.23). 
I will spare the reader of the details, making use of the Dirac equation and the 
properties of the Dirac y-matrices. What matters is that the contribution of every 
single diagram now carries a factor of p2 and p4 in the numerator, coming from 
the derivative couplings of the NG field x. Yet, this is not sufficient to make the 
amplitude vanish if either p2 or p4 goes to zero. 


2.3 Low-Energy Effective Field Theory 


The spectrum of our model (2.1) contains the massless NG boson zr, the Higgs mode 
Xx with mass my = m4/2, and the Dirac fermion y with mass My = gu//2 = 
mg//2k. At energies well below the mass scales of m x and my, these massive 
modes will not be excited. As a consequence, the physics of the toy model will 
reduce to that of the NG boson x. Yet, in both parameterizations (2.9) and (2.24), 
the Higgs field is needed even at low energies since it mediates interactions between 
NG bosons. In fact, in the exponential parameterization (2.24), the Lagrangian does 
not contain any direct self-interaction of x at all. This sounds like an overkill. We 
should not need any other fields just to describe self-interactions of the NG bosons. 
In the spirit of EFT, introduced in Chap. 1, we should be able to describe the low- 
energy physics of NG bosons using the z field alone. 

Such a low-energy EFT should respect the symmetries of the underlying theory 
defined by (2.1). With the transformation rules (2.27), this does not place any 
constraints on x, Y, which can thus be safely dropped. The shift transformation 
of zr, on the other hand, forbids operators without derivatives on z. It follows that 
the effective Lagrangian for the NG boson must be described by some, a priori 
unknown, function of derivatives of m. 

We also observed above that derivatives in the interaction vertices suppress 
their contribution to the S-matrix at low energies. More derivatives imply stronger 
suppression. This suggests an organization principle, whereby operators (with the 
same number of z fields) are hierarchically ordered according to the number of 
derivatives they contain. The dominant contributions to the S-matrix will come from 
operators with the fewest derivatives possible. Since each field x must still carry 
at least one derivative, the EFT will be dominated by interaction operators with 
exactly one derivative per each x. Lorentz invariance then constrains the effective 
Lagrangian to 


1 CO 
Lor = sum) +) clu T +o, (2.32) 


n=2 


where the ellipsis stands for operators with more than one derivative per x. 

The as yet undetermined couplings c2, govern the low-energy properties of 
scattering amplitudes of NG bosons. It remains to find out what these couplings are. 
In a concrete physical system, this could be done by performing a set of scattering 
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experiments. It is however also possible to fix the values of cz, theoretically from 
the underlying model (2.1). I will outline two strategies for doing so. 


2.3.1 Matching 


One possibility is to evaluate a set of scattering amplitudes, or other observables, 
in both the underlying model (2.1) and its low-energy EFT (2.32). Comparing the 
predictions allows one to fix cz, in terms of the parameters m, A, g. In the jargon of 
EFT, this is called matching. 

Let us see how it works in practice on the example of the 72 — mm scattering. 
Within the EFT, this is described by the c4 coupling. There is a single Feynman 
diagram, namely the first diagram in (2.16). The corresponding invariant amplitude 
reads, in terms of the Mandelstam variables, Az7_+77 = —2c4(s* +t? + u’). Note 
how the dependence on the particle momenta exactly copies the leading contribution 
to the previously calculated amplitude (2.18) in the limit of exact axial symmetry. 
Upon comparing the coefficients of the kinematical invariant s? + t? + u?, we find 


(2.33) 


The same procedure could in principle be followed to determine c6, cg and so 
on. That would however be a tedious task, since the number of Feynman diagrams 
contributing to an n-particle process grows rapidly with n. Sometimes, an alternative 
approach is feasible whereby the EFT is deduced from the underlying theory directly 
at the level of the Lagrangian. I will now demonstrate how to do this in the case of 
our toy model. 


A cautious reader might be wondering why I have made no mention of 
Feynman diagrams containing loops in all the discussion above. Within the 
toy model (2.1), restricting to tree level amounts to considering the leading 
contribution to scattering amplitudes in a power expansion in the small 
couplings À, g. The result (2.33) should therefore likewise be interpreted as 
the leading contribution to c4, induced by the presence of the heavy modes x 
and y. See Sect. 2.3 of [7] for more details. 


2.3.2 Eliminating the Heavy Modes 


In the exponential parameterization (2.24), all tree-level scattering amplitudes of NG 
bosons in the model (2.1) arise from Feynman diagrams that include virtual x modes 
in the propagators. Recall now that quantum field theory at tree level is equivalent 
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to its classical limit. Including the effect of virtual x quanta is then equivalent to 
solving the classical equation of motion (EoM) for x and inserting the result back 
to the Lagrangian. In this way, one can obtain an EFT for x alone that takes into 
account all the interactions of the original model (2.1). 

Let us see this program through to its end. We start by dropping the fermion 
field and putting the scalar interactions (2.25) together with the kinetic term into a 
complete scalar Lagrangian, 


1 À 1 x 2 
Z = (ux) — mx? hog? Ge tas) (8,7) . (2.34) 


The parameters m, A, v are related by v = m/v. The EoM for x, which we would 
like to solve for x in terms of x, reads 


1 
x +2m?x +3awx? +43 — (1+ 4) (a,x)? =0. (2.35) 
v v 


It is of course not possible to solve this equation in a closed form, we can however 
still extract useful information from it. At low energies, we expect x to be very 
small. (The probability to excite virtual x quanta far away from their mass shell 
is tiny.) In the first approximation, the terms in (2.35) quadratic and cubic in x 
can therefore be neglected. We can likewise drop the Lx term, suppressed by two 
derivatives. Equation (2.35) then becomes a linear algebraic equation for x with 
the solution x ~ (0,7)? /(2m7v). In order to go beyond this approximation, we 
rewrite (2.35) in a form suitable for iteration, 


1 
(ður). (2.36) 


1 2 1 2 2 
X = -| 2m +0- (ður) +3vx +AX 
v v 


All terms but 2m? in the square brackets are small due to containing either 
derivatives or extra factors of x. This makes it possible to iteratively expand (2.36) 
in inverse powers of m? up to any desired order in x and its derivatives. Up to and 
including the first subleading contribution to x, we thus get 


— ux)? Oury 


~ 2m?v 8m4v? (an 


When inserted back into (2.34), this is enough to generate the couplings c4 and c6 
in the Lagrangian (2.32). When the dust settles, we find c4 = 2/(4m*) and c6 = 0. 
The result for c4 agrees with our previous calculation relying on direct matching of 
scattering amplitudes. It may however come as a surprise that ce vanishes. 
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To get further insight, we multiply the EoM (2.35) by x/2 and add to the 
Lagrangian (2.34), giving 


AV 3 A 4, 1 x 2 
re +5 (1+) @x) l (2.38) 


The symbol ~ indicates that I have dropped a surface term. The obtained Lagrangian 
can be further manipulated by rewriting the EoM (2.35) as 


(ur)? xX 
À 2v) = -—, 
x(x + 2v) 2 rarer 


(2.39) 


Using this twice in sequence, it is possible to remove from (2.38) all terms that only 
contain x without any derivatives. One thus arrives at an equivalent Lagrangian, 


at etd ge) erp. | Oe 2] 
2 x Our + alum] | a +e]. (2.40) 


The first two terms here are just the kinetic term for x and the c4 coupling. The 
last term, upon expansion in 7 using (2.36), contains only operators with more than 
one derivative per x. This allows one to make the strong conclusion that all the 
couplings in the effective Lagrangian (2.32) except for c4 vanish. 


2.4 Moral Lessons 


In this introductory chapter, we have analyzed a simple toy model, introducing along 
the way some concepts that will play a key role throughout the rest of the book. Let 
me briefly summarize what we have found, and draw a few morals. 


Lesson #1 Some physical systems have multiple degenerate ground states. Barring 
accidental degeneracy, this happens typically when the system possesses a symme- 
try under which the ground states are not invariant. This is the defining property of 
SSB. 


Lesson #2 When the spontaneously broken symmetry is continuous, the spectrum 
of the system contains a massless particle: the NG boson. This is Goldstone’s 
theorem. 


Lesson #3 The scattering amplitude for a process involving a NG boson vanishes in 
the limit where its momentum is taken to zero. That is, NG bosons interact weakly 
at low energy. This is the Adler zero principle. The principle may be violated in case 
taking the NG boson momentum to zero brings some of the virtual particles in the 
process on the mass shell. 
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Lesson #4 It is convenient to choose a nonlinear field parameterization in which 
a constant NG field can be eliminated by a symmetry transformation. In such 
a parameterization, the NG field is derivatively coupled, making its physical 
properties of zero mass and weak interactions at low energy manifest. 


Lesson #5 The nonlinear field parameterization allows one to construct a low- 
energy EFT in terms of the NG field alone. This captures the physics of NG bosons 
at energies well below the mass scale of other, massive particles present in the 
system. 
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In the previous chapter, I worked out in detail a model of a single complex scalar 
field, in which a continuous symmetry is spontaneously broken. The corresponding 
axial transformations span the Abelian group U(1). Due to its simplicity, such a 
model is suitable for a first encounter with spontaneous symmetry breaking (SSB). 
However, it falls short of exhibiting the full spectrum of possible realizations of 
SSB. In this chapter, I will therefore generalize the model in two aspects. First, 
I will allow for multiple scalar fields carrying a nontrivial representation of a 
possibly non-Abelian symmetry group. This will take us to the level of standard 
expositions of SSB in textbooks on quantum field theory, oriented towards high- 
energy physics. Second, I will pay off my debt to readers with other backgrounds 
and show how the story changes if one gives up relativistic (Lorentz) invariance. 
This is particularly relevant for applications to condensed-matter physics, but also 
concerns dense relativistic matter that one deals with, for instance, in astrophysics 
and cosmology. 

The primary goal of this chapter is to introduce the reader to the intricate 
interplay between the pattern of SSB and the spectrum of the associated Nambu— 
Goldstone (NG) bosons. This goes a long way towards a broad qualitative under- 
standing of the behavior of physical systems with SSB at low energies. Namely, as 
explained in the previous chapter, NG bosons tend to interact weakly. In the absence 
of other massless particles, the low-temperature thermodynamics of the system will 
then be accurately described in terms of a free gas of NG bosons. The latter is 
completely characterized by the number of NG bosons in the spectrum and their 
dispersion relations. 

Within the landscape of relativistic field theory, a NG boson is always a massless 
particle whose dispersion relation is fixed by Lorentz invariance. Section 3.1 
illustrates how, in this case, the number of NG bosons can be determined solely 
from group theory. As soon as we give up Lorentz invariance, however, interesting 
things start to happen. Importantly, the NG state in the spectrum requires more data 
to specify than mere rest mass. Section 3.2 shows that the asymptotic behavior of 
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the dispersion relation of NG bosons at low momentum is intimately related to their 
number. Within this chapter, I will not discuss interactions of NG bosons any further, 
since there are no qualitative changes compared to the toy model of Chap. 2. 


3.1 Relativistic Models with Non-Abelian Symmetry 


Suppose we are given a set of real scalar fields, #'. Similarly to the previous chapter, 
I will introduce at once a class of toy model Lagrangians with a fixed kinetic term 
and a generic potential, 


1 ; s 
L = 81nd! dpi — VQ). 68.1) 


Here V (¢) is an a priori arbitrary function of ¢', only restricted by the requirements 
that it is bounded from below and its Taylor expansion around ¢' = 0 starts at the 
quadratic order. Since any complex scalar field can always be represented by two 
real ones, the class of Lagrangians (3.1) includes our original toy model (2.1). I 
have however dropped the fermion sector of (2.1). I have also set to zero the terms 
proportional to € and e*, as they only serve to select a unique ground state. With 
the experience gathered in Chap. 2, the reader should by now be comfortable with 
degenerate ground states. 

The symmetry of the theory defined by (3.1) will play a key role throughout 
this chapter. I will assume that the fields ¢! carry a linear representation of some 
Lie group G. Invariance of the whole Lagrangian (3.1) under transformations from 
G requires that the kinetic term and the potential are invariant separately. Let the 
total number of scalar fields be n. Then the kinetic term alone is invariant under 
continuous rotations of ¢', forming the orthogonal group SO(n). We therefore 
expect that G is a subgroup of SO(n), depending on the concrete choice of potential. 
With this in mind, I will focus in the following on the symmetry of the potential 
V (¢) itself. 

Suppose that the fields ¢' transform in some (real) representation R of G. The 
invariance of the potential V(@) under G is expressed by the condition 


V(R(g)¢) = Vo) foranygeG. (3.2) 


For continuous symmetries, it is usually more convenient to work with infinitesimal 
transformations. Denoting the generators of G as Q,4 z,..., the condition (3.2) then 
boils down to 


av mp 
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where g is the Lie algebra of G. This will be the starting point for the analysis of the 
spectrum of NG bosons below. 


Example 3.1 


It is straightforward to construct a potential that is invariant under the complete 
group of rotations, SO(n). Indeed, any function of the quadratic invariant, 6? = 
Sij ¢/, will do. For instance, a generic SO(n)-invariant quartic polynomial 
potential will take the form V(¢) = =m’? + 2+, which generalizes (2.5) from 
n = 2 to any n. The fields ¢! transform under the vector representation of SO(n), in 
which R(g) are n x n orthogonal matrices. The generators of SO(n) are labeled by 
a pair of vector indices; up to overall normalization, R(Ox1)! 7 equals 516 jl 5/5 jk. 


3.1.1 Spectrum of Nambu-Goldstone Bosons 


Let us choose a potential V(@) so that there is a ground state in which some of 
the fields ø’ have a nonzero vacuum expectation value (VEV), (¢'). The elements 
of G that leave the VEVs unchanged form a group H, referred to as the unbroken 
subgroup of G, 


H = {h € GIR oi) = (')}- (3.4) 


The generators of H will be labeled as Q«,g,.... As an immediate consequence 
of (3.4), such unbroken generators satisfy 


R(Qa)' (pi) =0 forany Qa €b, (3.5) 


where b is the Lie algebra of H . In many physical systems, H is a proper subgroup of 
G. That is, there are transformations from G that do change some of the VEVs (¢’), 
hence also the ground state itself. Representing a symmetry of the Lagrangian (3.1) 
but not of the ground state, these are said to be spontaneously broken. 

By definition of symmetry, acting with an element of G on any field configuration 
gives a (possibly different) field configuration with the same energy. Hence, as 
already observed in the previous chapter, the existence of symmetry transformations 
that do not leave the ground state invariant implies the existence of degenerate 
ground states. We also previously found that this is closely related to the presence 
of massless particles in the spectrum. 

Let us see how the NG bosons emerge in the present general setting. We start by 
choosing a basis {Q«, Qa} of the Lie algebra g such that {Qg} is a basis of § and 
Qa ¢ b, that is R(Qa) (p/) Æ 0. Next, we take a derivative of (3.3) with respect to 
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H. Upon some relabeling of indices, this becomes 


a°V(¢) a 
agiagi * (2a) 19" + 


aV() 


joi R04), = 0; (3.6) 


Finally, we evaluate this condition in the chosen ground state and use the fact that 
by definition, the first derivatives of V(@) in the ground state vanish, 


3 V ($) 
dp dpi y= 


RQA) (6*) =0. (3.7) 
) 


This does not say anything about the unbroken generators Qg, for which the left- 
hand side identically vanishes. However, for the broken generators Q4, this implies 
that R(Q,)! j (p/) is an eigenvector of the Hessian matrix of V (#) in the ground state 
with zero eigenvalue (zero mode). Moreover, the set of eigenvectors R(Q,)' j (o?) 
with all the different choices of Q4 is linearly independent. To see why, suppose that 
there are coefficients c} such that c*R(Oa)' (o?) = 0. Then by (3.5), c° Qa € 5b. 
But we have assumed that the set {Q,, Qa} is linearly independent, hence cg = 0. 

This proves that for every broken symmetry generator Q4, the Hessian matrix of 
V(@) in the ground state has a corresponding independent zero mode. Now recall 
that with a canonically normalized kinetic term as in (3.1), the Hessian of V(@) 
represents the mass matrix of the theory. That is, its eigenvalues give the squared 
masses of elementary one-particle excitations in the spectrum. Thus, there is one 
NG boson in the spectrum for each broken symmetry generator. The total number 
of NG bosons in the toy model (3.1) is dim G — dim H. 


What I have just presented is (a simplified version of) one of the original 
proofs of Goldstone’s theorem [1]. It is worth stressing that the number 
of zero modes of the Hessian matrix of the potential may be larger than 
dim G — dim H. First, I have not proven that the Hessian cannot have zero 
modes entirely unrelated to symmetry. (It can.) Second, it is possible that 
the symmetry group of the potential V(@) alone is larger than that of the 
kinetic term, and thus of the whole Lagrangian (3.1). In both cases, a classical 
analysis as worked out in this and the preceding chapter may yield “fake” 
NG bosons. These are scalar fields that have no classical mass term in the 
Lagrangian, but acquire mass solely due to quantum corrections [2,3]. As 
I will explain in Chap. 6, the vanishing of the mass of true NG bosons is 
guaranteed by SSB. This is an exact result that must also hold within any 
approximation that respects the symmetry of the given theory. 

Let me stress that everything said so far relies on the kinetic term in the 
Lagrangian being a positive-definite quadratic form in the time derivatives 
of ¢'. Finding the mass spectrum of a scalar theory such as (3.1) is then 


(continued) 
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equivalent to finding the normal modes using the theory of small oscillations 
of classical mechanical systems [4]. Section 3.2 revolves largely around the 
subtleties that arise when this requirement is not satisfied, which may happen 
once we depart from the landscape of Lorentz-invariant field theory. 


Example 3.2 


Consider a model of an n-plet of scalar fields øt whose Lagrangian is invariant under 
the group G ~ SO(n). Any ground state in which at least one of the VEVs (@') is 
nonzero will be invariant under the subgroup H ~ SO(n — 1) of transformations 
that leave (g') fixed. In other words, H consists of field rotations in the n — 1 
directions orthogonal to (¢'). By the general argument developed above, we expect 
the spectrum of such a model to contain dim SO(n) — dim SO(n — 1) =n — 1 NG 
bosons. Note that the special case of n = 2 correctly recovers the single NG boson 
we found in Chap. 2. 


3.1.2 Low-Energy Effective Field Theory 


On general grounds, we expect the dynamics of NG bosons to be captured by a low- 
energy effective field theory (EFT). In Sect. 2.3.2, I showed how to derive such an 
EFT explicitly from the underlying Lagrangian. Here we do not know the precise 
form of the potential V (@) or its symmetry. It is however still possible to gain useful 
insight by following, if only schematically, the same argument as in Sect. 2.3.2. 

The first step towards the EFT for NG bosons is a suitable choice of parameteri- 
zation of the fields g'. The exponential parameterization (2.24) for a single complex 
field can be generalized to the set of real fields ¢! as 


g(x) = U', a) [(b/) + x/@)] . (3.8) 


Here U (zr) is a matrix taking values in the representation R of G. It encodes a set of 
NG fields 2“, one for each broken generator Qa. One can for instance choose the 
generators as real antisymmetric matrices and imagine that U (sr) is the orthogonal 
matrix exp[iz“R(Q,)]. But a precise form of U (zr) is not important. All that I will 
need is that when expanded in powers of 7“, the leading term is U(O) = 1 and the 
linear term is proportional to 7“R(Qz,). Finally, xi encodes a set of Higgs modes 
whose masses are expected to be nonzero. 
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In spite of the suggestive notation, the different components of x! are not 
linearly independent. By mere counting of degrees of freedom, the number 
of independent Higgs fields should be n — dimG + dim H. What exactly 
these independent linear combinations of x! are, can be identified using 
group theory. One just has to decompose the space of ø’ into irreducible 
representations of H and drop the representations corresponding to the NG 
bosons. 


The advantage of the parameterization (3.8) is that the matrix U (zr) drops out of 
the potential V(@), because the latter is by construction G-invariant, cf. (3.2). The 
fact that U (zr) depends implicitly on spacetime coordinates through the fields 1“ (x) 
does not need to bother us since the potential does not contain any derivatives. In 
this parameterization, the Lagrangian (3.1) will therefore have a similar structure 
to that in (2.34). The NG fields x4 will only enter through the kinetic term, that is 
together with derivatives. The nonderivative potential, on the other hand, will only 
depend on the Higgs fields x’. Using the shorthand notation @, x for the vectors 
¢', x’, the Lagrangian reads explicitly 


1 
Z= 59x OX + Aux UTU - A) + x) 


1 
+ 5b) + xX) UT IHU» UH) +) = VO) - (3.9) 


We would now like, at least in principle, to eliminate x! by using its equation of 
motion (EoM). It is important to make sure that the expansion of x! in powers of 2“ 
generalizing (2.37) starts with a term with (at least) two derivatives and two powers 
of x“. This in turn requires that the Lagrangian (3.9) does not contain any mixing 
term, linear in both ¢! and x’. The vanishing of such mixing terms can be achieved 
by choosing the vector x to be orthogonal to all R(Qzq) - ($). 

The rest of the argument is simple. Inserting the solution for x! in the 
Lagrangian (3.9) gives terms that contain at least four derivatives. The effective 
Lagrangian for the NG fields will however be dominated by operators with only 
two derivatives, since a higher number of derivatives means stronger suppression at 
lower energies. Such two-derivative terms are obtained by simply setting x’ > 0 
in (3.9), 


Logg = (Ø) 3.07 aU - (9) +e, (3.10) 


where the ellipsis stands for contributions with more than two derivatives. Remark- 
ably, this leading contribution to the EFT arises solely from the kinetic term in (3.1). 
The concrete choice of potential V (ġ) does not matter. All we need to know are the 
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VEVs (¢’) and the ensuing symmetry-breaking pattern G —> H. These together fix 
the matrices U (zr) up to a choice of parameterization in terms of the NG fields zr“. 


Example 3.3 


Let me illustrate this on the case of spontaneous breaking SO(n) —> SO(n — 1) 
by an n-vector field @, introduced in Example 3.2. Here U (x) takes values in the 
defining (vector) representation of SO(n). Thus, U (x) - (@) is a vector of the same 
length as (@). By a suitable choice of units, we can make this length whatever we 
want. It is conventional to set U (x) - (@) = vn(z), where n is a unit vector field 
and v is a dimensionful constant. Then the effective Lagrangian (3.10) acquires the 
form 


2 
Lert = Fn: dnt (3.11) 


which is known as the nonlinear sigma model. Note that in spite of the appearance, 
this is not a noninteracting field theory. The unit vector field n takes values from 
the unit (n — 1)-sphere, S”~!. The NG fields 2“ can be thought of as n — 1 
independent coordinates on the sphere. Once a choice of coordinates is made and 
the Lagrangian (3.11) is expanded in powers of x“, it is going to contain operators 
with two derivatives and an arbitrarily high number of NG fields. 


This example underlines an important distinction between the broader class of 
theories (3.1) and the special case analyzed in Chap. 2, where the symmetry group 
G ~ U(1) was Abelian. In the latter case, the leading contribution to the effective 
Lagrangian contains operators with exactly one derivative on each NG field x. The 
part of the Lagrangian with two derivatives is then just the kinetic term, and any 
interactions necessarily come with four or more derivatives. Once G is allowed to be 
non-Abelian, the two-derivative part of the Lagrangian (3.10) contains interactions 
bringing together an arbitrarily high number of NG bosons. 

Equation (3.10) gives us the first hint that the form of the EFT for NG bosons 
is controlled by symmetry, regardless of the details of the underlying model. The 
Lagrangian can be cast solely in terms of the NG fields 7“, 


1 
Lett = 5 8ab (apr Or” feoda 


IU (x)! IU (x) 
one an? 


Sab(T) = ($) 


($). (3.12) 


I will show in Chap. 8 that the matrix function gap(7r) can be interpreted in terms 
of the geometry of the Lie groups G and H. The dependence of gap(zr) on the NG 
fields is fixed by this geometry. All that is left of the concrete model leading to the 
EFT (3.12) is the numerical value of the constant matrix ggp(0). 
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3.2 Going Nonrelativistic 


In all the examples worked out in the previous and this chapter so far, I restricted 
myself to relativistic, Lorentz-invariant field theories. This choice was based on 
technical simplicity. The notation using Lorentz indices is more compact, and 
the calculations of scattering amplitudes in terms of the relativistic Mandelstam 
variables are more transparent. There are good reasons to step out of the box, 
though. First, the great majority of natural phenomena occurs at energies well 
below the scale at which relativity starts to matter. In the spirit of EFT, such 
phenomena should therefore have an accurate description in terms of nonrelativistic 
field theory. Second, as we shall see in this section, relaxing the requirement of 
Lorentz invariance leads to rich phenomenology. The examples developed below 
will prepare the reader for the more thorough discussion of SSB that comes in Part II 
of the book. 

Operationally, what I will do is to modify the previously discussed toy models 
in a way that respects invariance under spatial rotations, and spatial and temporal 
translations. While this does not place any restrictions on the potential V(@), the 
kinetic term may now assume a more general form. The Lagrangian may then 
contain independent terms with either two spatial derivatives, or one or two temporal 
derivatives. The very possibility of adding terms with a single time derivative 
constitutes the main qualitative difference compared to relativistic field theory. 


3.2.1 Single Schrödinger Field 


In order to copy as closely as possible the previous discussion of relativistic field 
theory, let us start with the following model, 


L =2iMd* db — VO* -Vo +m o*b — App. (3.13) 


Here ¢ is a complex scalar field and do stands for a time derivative. This Lagrangian 
is almost identical to the scalar sector of (2.1), except for the temporal part of the 
kinetic term. For m = A = 0, the corresponding EoM is the Schrödinger equation 
for a free particle of mass M. It is therefore natural to refer to the field @ as a 
“Schrödinger field.” 

The analysis of the model (3.13) now follows the same steps as in Chap. 2. The 
classical Hamiltonian density reads 


KH =Nb*-Vo—m' $b +A. (3.14) 
The lowest energy is achieved by a coordinate-independent state such that 


(3.15) 


3.2 Going Nonrelativistic 35 


where 0 is an arbitrary phase, reflecting the ground state degeneracy. Making use of 
the exponential parameterization (2.24) brings the Lagrangian to the form 


L = Lyre t+ Litin + Gat - (3.16) 


The vacuum piece equals yac = (1/ 2)m? v? — (A /4)v* = m* /(4A). The bilinear 
and interaction parts of the Lagrangian read, respectively, 


1 1 
pilin = —2M x do rue eg mx? NY: 


à M 2 
Zin = —Avx? — 7x" -xX don (4+25) cm. (3.17) 


where I have discarded some contributions that amount to a total time derivative. 

Just like for the relativistic model (2.1), the NG field x enters the Lagrangian 
only with derivatives. This is an immediate consequence of the choice of parame- 
terization (2.24). Here is where the similarity ends, though. It would be a mistake to 
conclude that x represents a NG boson and x describes a massive, Higgs mode. The 
two fields are mixed by the bilinear term —2M x doz, which makes x, 7x canonically 
conjugated to each other. We therefore expect that the spectrum of the model (3.13) 
contains just one type of excitation, which should then be a NG boson. 

In order to describe this excitation more accurately, let us rewrite the bilinear part 
of the Lagrangian in a matrix form, 


1 V? — 2m? —2Ma\ (x 
Shilin X 5 (x n)( 2M a v2 ) (*) . (3.18) 


The symbol ~ again indicates that I have dropped a surface term. What is entirely 
new compared to relativistic field theory is that the bilinear Lagrangian cannot be 
diagonalized by any local field redefinition. This is just a minor nuisance when 
it comes to finding the spectrum of one-particle states. It is however a major 
complication if one wants to study scattering of the particles. Calculating a cross- 
section for a scattering process now requires a careful mapping of one-particle states 
to fields in the Lagrangian, and a revision of rules for phase space integration. Some 
discussion of these issues can be found for instance in [5—7]. 

In order to find the dispersion relation of the NG mode, we Fourier-transform 
and set the determinant of the matrix in (3.18) to zero. This gives the energy E of 
the NG boson as a function of its momentum p, 


E(p) = Pl fp + 2m? (3.19) 


Note that in the limit m — 0, this recovers the conventional dispersion relation of a 
nonrelativistic particle, E(p) = p*/(2M). 
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Once the dispersion relation is no longer fixed by Lorentz invariance and the 
rest mass, it makes little sense to refer to a NG boson as a massless particle. 
Instead, it is common to say that the NG boson is a gapless mode or excitation. 
This is however a bit of a misnomer. The gap in the excitation spectrum is 
defined as minp E(p). On the other hand, the characteristic property of NG 
bosons is, as I will explain in detail in Chap. 6, that 


lim E(p)=0. (3.20) 
p-0 


Since the excitation energy above a stable ground state is by definition 
positive, a NG boson is always gapless. However, a gapless excitation may 
not necessarily be a NG boson. There are physical systems where the 
dispersion relation of a one-particle excitation develops a local minimum due 
to dynamical effects. This is the case for instance for the “roton” mode in 
superfluid helium [8]. I am however not aware of any example where the 
energy of the excitation at the local minimum could be tuned to zero without 
breaking some symmetry. 


One might hope that it is possible to get rid of the annoying mixing between the 
x,7 fields by eliminating the Higgs field x as in Sect. 2.3.2. Let us see how the 
derivation of the low-energy EFT for z alone goes. We start by writing the bilinear 
and interaction parts of the Lagrangian (3.17) together as 


1 X 1 X N2 
Z ~ —-(Vy)? = mx? — Avy? 4 1 ) z, 3.21 
5 (Vx)? — mx? — doy? — Sx4— 5 (14-2 3.21) 


up to terms that are a total time derivative. I have introduced a shorthand notation, 
E =2Mvdon + (Vr). (3.22) 


Note that the Lagrangian (3.21) is nearly identical to (2.34) except for the replace- 
ment (ð, x)" > -(V x) and a different identification of &; in (2.34) it is simply 
—(Ou7). We can therefore follow the same steps as in Sect. 2.3.2 to eliminate x. 
All we have to do is to take the results thereof and substitute (0,70)? — —&E and 
—> —V? wherever appropriate. Thus, an iterative solution of the EoM for x is 
obtained at once from (2.36), 


fen] 
t= 
2 


1 —1 
x= (2m? V? + +3awx + 222) © 
v 


v 
vT 
(: z) 5,2, FOE): (3.23) 


G) 
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By using this in (2.40), we then readily determine the first two terms of the effective 
Lagrangian expanded in powers of ©, 


z z y2! 
Loe ~ > (: >) a +0(2°). (3.24) 


4m2v2 2m 

While restricted to the lowest orders in & and hence x, this Lagrangian still 

contains terms with an arbitrarily high number of derivatives. The price for having 

eliminated the kinetic mixing between x and z therefore is an introduction of 

nonlocal operators. This is necessary to get the dispersion relation of the NG boson 
right. Indeed, by substituting from (3.22), we find 


do 
Aa 2 


1 
= 5 (Vx) + O(n). (3.25) 
Fourier transformation to the energy-momentum space then recovers our previous 
result (3.19). 


3.2.2 Multiple Nambu-Goldstone Fields 


We would of course like to understand also what happens in models that contain 
multiple nonrelativistic scalar fields. Are there any further surprises awaiting us? 
Instead of developing a general picture akin to the relativistic framework of 
Sect. 3.1, I will focus here on some instructive examples. A more complete analysis 
will follow in Chap. 8 using the powerful machinery of EFT. 

Let us first contemplate what we expect to find. Consider a model with multiple 
(real) scalar fields œ! and a generic potential V(@) as in (3.1). The argument of 
Sect. 3.1.1 then goes through without change. The conclusion that there is a well- 
defined injective mapping from broken symmetry generators to zero modes of the 
Hessian matrix of V(@) in the ground state remains. One can say that the number 
of linearly independent NG fields still equals dim G — dim H, in one-to-one corre- 
spondence with the broken generators. However, with a generalized, nonrelativistic 
kinetic term, we are no longer guaranteed the existence of an independent NG mode 
for each such NG field. In presence of terms in the Lagrangian with a single time 
derivative, some of the NG fields may be canonically conjugated to each other. We 
then expect one NG mode in the spectrum to be associated with a pair of NG fields. 

Let me conclude this chapter and the whole Part I of the book with a couple of 
illustrative examples. 


Example 3.4 


Consider the limit m = A = 0 of (3.13), already briefly mentioned previously, 


L = Mp dvp —VO*-Vo. (3.26) 
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Here the potential is vanishing, hence the real and imaginary parts of @ are both 
trivially zero modes. There is a single one-particle excitation in the spectrum 
with dispersion relation E(p) = p*/(2M). This is a theory of a noninteracting 
Schrödinger field, which seems to be rather uninspiring. Things get more interesting, 
though, once we look at the theory from the point of view of symmetry. 

There is a U(1) œ~ SO(2) symmetry under which the field transforms as 
$ — e€¢. The natural choice of ground state with (p) = 0 leaves this symmetry 
unbroken. However, the free Schrödinger theory (3.26) has another symmetry, 
namely øo —> ¢ + €; + i€2, where both €1,2 are real parameters. This shifts the 
Lagrangian by a total time derivative and thus leaves the action invariant. Such a 
shift symmetry is necessarily spontaneously broken no matter how the VEV (@) 
is chosen. At the end of the day, we have two spontaneously broken generators 
but only one NG mode with a quadratic dispersion relation. In some aspects, 
the free Schrödinger theory (3.26) constitutes a rather nontrivial realization of 
a nonrelativistic NG boson. A detailed discussion of SSB in the quantized free 
Schrödinger theory is offered in Sect. 5.3. 


For an example of an interacting field theory, replace the single complex field œ 
in (3.13) with a two-component complex field (doublet) ®, 


Z = Mb dt — Vİ. Vo +m o'o— a(d p). (3.27) 


Here the dagger indicates Hermitian conjugation. This Lagrangian possesses a U(2) 
symmetry. It has an Abelian subgroup, U(1), under which the doublet ® changes 
its overall phase, ® —> e!@. What is new is the non-Abelian SU(2) subgroup of 
U(2) under which P > R(g)®, where g € SU(2) and R stands for the fundamental 
representation thereof. 

The analysis of the model (3.27) proceeds as before. The classical Hamiltonian 
is minimized by any constant field ® such that (®'®) = v*/2 = m?/(2d). 
Geometrically, the set of all degenerate ground states corresponds to a 3-sphere, S°. 
This is easy to see if one thinks of ® as a four-component real vector whose length 
is fixed by minimization of the Hamiltonian. While any of the different degenerate 
ground states is equally good, the conventional choice that simplifies notation is! 


v (0 
= — . 2 
(®) A (‘) (3.28) 


Out of all the generators of U(2), the only one (up to normalization) that leaves this 
VEV unchanged is 1 + 13, where t3 is the third Pauli matrix. The remaining three 


1 Any other choice of ground state can be obtained from our (®) by multiplication with a constant 
U(2) matrix. This generalizes the arbitrary phase e}? in (3.15). 
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linearly independent generators are spontaneously broken. We therefore expect three 
of the four degrees of freedom contained in ® to represent NG fields. 

The low-energy spectrum of the model (3.27) is most easily addressed within an 
EFT for the three NG fields. To that end, we need to choose a parameterization for 
®. Inspired by (3.8), let us set 


1 
V2 


where U (zr) is a unitary matrix encoding the three NG degrees of freedom and x is 
a (real) Higgs field. A straightforward manipulation then leads to a Lagrangian of 
the type (3.21), except that we now have to set 


(x) = vaw ( n k (3.29) 


v+ x(x) 


E = —2iMv’ (UDU) + v (VUT - VU) (3.30) 
The leading contributions to the effective Lagrangian are again given by (3.24). To 


complete the analysis, we parameterize U (7x) in terms of a triplet of NG fields, 
x(x), 


U(r) = exp (=: . x) R (3.31) 


Up to second order in the NG fields, which is needed to pin down the bilinear part 
of the effective Lagrangian, one finds 


E = —2Mvdon? — 2M (a x dom)? + bap Vr" Vr? +0?) . (3.32) 
Upon using this in (3.24), one thus obtains 


do 


as 2 3 
L ~ 2M don a2 ya” 


1 
3 say 

1 1 
+ M(x! don? — mar!) — se) = VY + O(r) . (3.33) 


The dispersion relation of 2° is the same as for the model (3.13) with a single 
complex field, namely (3.19). The z!,2* sector is however very different. The 
second line of (3.33) is just the free Schrödinger theory in disguise; 7 !, 2? are 
the real and imaginary parts of the complex Schrödinger field. The corresponding 
dispersion relation is E(p) = p*/(2M). It is interesting to note that the bilinear 
Lagrangian in the 2!,2* sector comes entirely from the term in the effective 
Lagrangian (3.24) linear in &. It is therefore insensitive to the choice of potential in 
the underlying Lagrangian (3.27). 

The above discussion has an immediate generalization to a class of models 
described by the same Lagrangian (3.27), in which ® is an n-component complex 
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field. The symmetry is then G ~ U(n). The ground state can still be chosen 
as any constant field such that (®'®) = v~/2. It is convenient to pick (®) so 
that, analogously to (3.28), only its the n-th component is nonzero and equals 
v/./2. The symmetry group G is then spontaneously broken down to the subgroup 
H ~ U(n — 1), acting on the first n — 1 components of ®. Accordingly, there are 
dim U(n) —dim U(n— 1) = 2n—1 broken symmetry generators as well as NG fields. 
The analysis of the excitation spectrum also proceeds following the same steps, 
except for the necessary complications due to the presence of unitary n x n matrices. 
At the end of the day, one finds one NG boson with dispersion relation (3.19). The 
remaining 2n — 2 NG fields pair up into n — 1 modes with the Schrédinger-like 
dispersion relation E(p) = p*/(2M). 


3.3 Moral Lessons 


In this chapter, I have refined the discussion of toy models for SSB in two ways. 
First, I have allowed for the possibility of multiple scalar fields and multiple broken 
symmetries. I have however remained within the territory of internal symmetries, 
that is symmetries only acting on the fields, independently of spacetime coordinates. 
Second, we explored the consequences of giving up Lorentz invariance, in particular 
by adding to the Lagrangian terms with a single time derivative. With the newly 
gathered experience, let me briefly revisit the four lessons drawn in Sect. 2.4. 


Lesson #1 Nothing to change. The statement made in Sect. 2.4 is generally valid. 


Lesson #2 Suppose that the given system possesses a continuous symmetry group 
G which is spontaneously broken to its subgroup H. If the action of the system is 
Lorentz-invariant, its spectrum contains dim G — dim H massless particles: the NG 
bosons. If the system is not Lorentz-invariant, NG bosons are characterized as one- 
particle excitations with dispersion relation E(p) such that limp+9 E(p) = 0. The 
low-energy dynamics of the system can still be captured in terms of dim G — dim H 
NG fields. However, the actual number of NG modes in the spectrum may be lower 
than dim G — dim H. This happens when two NG fields are canonically conjugated 
by a term with a single time derivative, leading to a single NG mode. The dispersion 
relation of such a NG mode is typically quadratic in momentum. 


Lesson #3 Nothing to change. The statement made in Sect. 2.4 is generally valid. 


Lesson #4 It is convenient to choose a field parameterization in which a set of 
constant NG fields can be eliminated by a symmetry transformation. In such a 
parameterization, every operator in the Lagrangian containing NG fields carries 
at least one derivative. If the symmetry group G is non-Abelian, it is however 
not necessary that every NG field carries a derivative. The nonlinear constraints 
imposed by the broken symmetry nevertheless ensure that the Adler zero property 
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of scattering amplitudes is preserved, modulo the exceptions alluded to in Lesson 
#3. 


Lesson #5 Nothing to change. The statement made in Sect. 2.4 is generally valid. 
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Anybody who has taken a first course on field theory has been exposed to the 
correspondence between symmetry and conservation laws via Noether’s theorem. 
Indeed, I already assumed basic familiarity with this correspondence in Part I. The 
semiclassical analysis of the toy models therein would have hardly been possible 
without the concepts of symmetry of a Lagrangian and Noether current. 

In this chapter, I will however restart the discussion of symmetries largely from 
scratch. With the rich spectrum of modern applications that emerged in the last 
decades, it appears appropriate to start by carefully defining the basic notions. I will 
not attempt to give the most general definition of symmetry. In particular, most of 
this chapter is phrased in the language of classical field theory, tailored to the needs 
of Parts III and IV of the book. Some remarks on a further generalization of the 
concept of symmetry developed here are postponed to the concluding Sect. 15.4. 
Nevertheless, within the conservative exposition offered here, I will devote more 
space to stressing exceptions rather than to repeating ad nauseam familiar concepts. 


4.1 What Is Symmetry? 


In very general terms, any definition of symmetry must include two ingredients. 
The first of these is an object that we wish to declare to be symmetric. The second 
is the operation, or transformation, that should constitute the desired symmetry. In 
classical physics, one usually defines symmetry by its action on a set of local fields. 
I will use the notation w' for a set of fields treated as functions w! : x” > w(x). 
Here x” denotes collectively a set of coordinates, which may for the time being 
include space, time, or both. The case where x” includes only time ¢ corresponds to 
mechanics with y(t) as the dynamical variables. In mathematics, it is common to 
refer to x” and y’ respectively as the independent and dependent variables. These 
(in)dependent variables may take values from some linear space or from a more 


© The Author(s) 2024 45 
T. Brauner, Effective Field Theory for Spontaneously Broken Symmetry, Lecture 
Notes in Physics 1023, https://doi.org/10.1007/978-3-031-48378-3_4 


46 4 Symmetry and Conservation Laws 


general mathematical structure such as a manifold. In the latter case, x” and yi are 
identified with the corresponding local coordinates in the sense of Appendix A.1. 


4.1.1 Symmetry Transformations 


Let us initially focus on the second ingredient of symmetry, that is the operation. 
This book revolves largely around the consequences of continuous symmetries, 
for which it is sufficient to consider infinitesimal transformations. We shall deal 
with the following generic class of simultaneous transformations of the fields and 
coordinates, 


syi (x) = y a) — yia) = € FLW, x1) , 
òx” = x" — x" = e X" [y, x](x) . (4.1) 


Here e is an infinitesimal parameter of the transformation. The square bracket 
notation indicates that Fİ and X” are local functions of the fields and (a finite 
number of) their derivatives, possibly depending explicitly on the coordinates. 
Transformations of the type (4.1) are known in mathematical literature as gener- 
alized local transformations. The terminology is historical. Namely, the class (4.1) 
generalizes so-called point transformations, first studied by Sophus Lie in the 1860s. 
For those, Fİ and X“ are only allowed to depend on fields and coordinates, not on 
field derivatives. 


Indicating that the fields and coordinates should be transformed simultane- 
ously is, while conventional, a red herring. The coordinates are independent 
variables that can be chosen at will. Any nonsingular transformation of 
coordinates can be undone by a change of variables. The content of (4.1) can 
therefore be equivalently encoded in a transformation of the fields alone, 


w(x) — yia) = € FLW, x(x) — eX", x1) dud @) . (4.2) 


This is the evolutionary form of the transformation (4.1). More generally, the 
representation (4.1) of the transformation is ambiguous with respect to the 
redefinition X“ —> X” + X” and Fİ > Fi + “ə yi, where X“[y, x] 
is any local function of the coordinates, fields and their derivatives. The 
evolutionary form can thus be viewed as fixing the ambiguity by setting 
NES] 


The class of transformations (4.1) is so broad that it is often convenient to 
consider special cases. Throughout this book, I will use the term internal symmetry 
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for a point transformation with X” = 0 and a coordinate-independent F’, that is 
Swi (x) =€F'(W(x)), 8x“=0 (internal symmetry) . (4.3) 


A point transformation for which X” is nonzero but only depends on the spacetime 
coordinates will be referred to as a spacetime symmetry, 


Swi(x) =F (W(x), x), sx’ =eX“(x) 40 (spacetime symmetry) . 
(4.4) 


These definitions of internal and spacetime symmetries define the agenda for 
Parts III and IV of the book. The reader should however be warned that these are 
not standard definitions aligned with published literature. Of all the reasons, this is 
because a precise definition of internal and spacetime symmetries is rarely found. 


A spacetime translation in the direction of coordinate x” is most naturally imple- 
mented by setting Sywi (x) = Oand 8,x" = 6d’. According to (4.2), this can be 
equivalently encoded as Swi (x) = —€0, yi (x) without any change of coordinates. 
Translations are an example of a “purely spacetime” transformation, illustrated in 
the right panel of Fig. 4.1. As the figure shows, one can intuitively think of internal 
and spacetime symmetries as point transformations, generating flows along different 
subspaces of the space of fields and coordinates. 


———————— 


x x 


Fig. 4.1 Schematic illustration of internal (left panel) and spacetime (right panel) symmetries. 
The oriented curves indicate the flow in the space of fields and coordinates (see Appendix A.3), 
generated respectively by the infinitesimal transformations (4.3) and (4.4). To stress the difference 
between the two cases, the right panel shows a “purely spacetime” transformation, for which F! = 


0 


48 4 Symmetry and Conservation Laws 


4.1.2 Object of Symmetry 


I will assume throughout the book that the set of all symmetry transformations, 
generated by the infinitesimal motions (4.1), constitutes a group. Historically, the 
mathematical field of Lie groups arose from the study of symmetries of differential 
equations. This is now a mature subject with a large body of literature devoted to 
it. An interested reader is referred to [1-3] for more details. Very briefly, the major 
applications of symmetry methods to differential equations include: 


e Finding new solutions of a given differential equation from already known ones. 
e Finding solutions respecting the symmetry of the differential equation. 
e Classifying differential equations with given symmetry. 


In physics, it is much more common to define symmetry by invariance of the action 
of a system under some transformation. It is worth stressing that the two notions of 
symmetry are not equivalent. On the one hand, there are differential equations that 
do not originate from any variational principle, yet may have nontrivial symmetries. 
On the other hand, differential equations that do descend from an action functional 
may have a larger symmetry group than the action itself. 

The chief advantage of defining symmetry through an action functional is that 
this gives us a direct link to conservation laws. The link is supplied by the celebrated 
Noether theorem, which is the subject of most of Sect.4.2. It is in principle also 
possible to define conservation laws directly on the level of a differential equation. 
This allows one to deduce a restricted correspondence between symmetries and 
conservation laws that generalizes the Noether theorem [3]. However, as far as I 
know, such a generalized notion of symmetry is of limited use in physics, and I will 
therefore not pursue it any further. 


4.2 Lagrangian Approach to Symmetry 


The correspondence between continuous symmetries and conservation laws has 
a fascinating history. A nice overview of the developments following Noether’s 
groundbreaking discovery, including an account of the contributions of various 
authors to the subject, can be found in [4]. Noether’s theorem is covered to some 
extent in virtually any textbook on classical mechanics or (classical or quantum) 
field theory. Unfortunately, this is frequently done under unnecessary restrictions 
on the Lagrangian or on the type of symmetry transformation. Here I will present a 
fairly general version of Noether’s theorem following a trick, which to the best of 
my knowledge goes back to Gell-Mann and Lévy [5]. 
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4.2.1 Noether’s Theorem 


I consider a class of field theories defined by a Lagrangian density , which is a 
local function of a set of fields y’ and their derivatives, 


S= jars Liy, x(x). (4.5) 


Here D is the dimension of the space of independent variables, which may still 
include space, time or both. The Lagrangian density may depend explicitly on the 
coordinates, and there is no restriction on the order of field derivatives it contains. 
Suppose that the action S is invariant under the infinitesimal transformation (4.1).! 
Then there is a local vector field J”[y, x], called Noether current, which is 
divergence-less, 


ð J iy, x]=0  (on-shell) . (4.6) 


The qualifier on-shell reminds us that the conservation law (4.6) only holds for fields 
satisfying the Euler-Lagrange equation of motion (EoM). 

To prove this claim, we evaluate the variation of the action 5S under a modifi- 
cation of (4.1) where the parameter €(x) is allowed to depend on the coordinates. 
With the assumption of invariance under the original transformation (4.1), 8S may 
only depend on derivatives of e (x). Using integration by parts, it can then always be 
brought to the form 


èS = f dx J” Iy, x](x)d,e(x)  (off-shell) . (4.7) 


Here the qualifier off-shell indicates that I have not yet imposed the EoM on the 
fields. We can see that the Noether current can be extracted as the coefficient of 
ð € in the variation of the action. Once the EoM is applied, 8S must vanish by 
the definition of the variational principle. This guarantees the on-shell conservation 
law (4.6). 


Example 4.2 


For a simple example, consider a free massless relativistic scalar field @ with the 
Lagrangian density Z [p] = (1/ 2) (8n p). This is obviously invariant under a shift 
of the field, p — +e. Making the shift coordinate-dependent, we find the variation 
of the action 6S = f dPx Ə” p (x)ð E(x). Comparison with (4.7) tells us that the 
Noether current is J“[@] = a“. Conservation of this current is equivalent to the 


1 When evaluating the variation of the action under (4.1), it is allowed to drop whatever boundary 
terms might appear. Thus, the “invariance of the action” should be more accurately interpreted as 
invariance of the Lagrangian density up to the divergence of a vector field. 
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EoM for ġ, which is the massless Klein—Gordon equation. As an aside, such an 
equivalence holds for any theory of a real scalar ¢, invariant under the shift 6 > 
$ + €. This follows immediately from the variation of the action under the localized 
transformation d(x) > (x) + €(x), 


ès 
èS = fa aq) = - f ax In J [b, xæ) x) , (4.8) 


which implies that 0, J” [ġ, x](x) = —8S/ð¢ (x) off-shell. 


The statement of Noether’s theorem is well-known. Instead of collecting numer- 
ous examples, I will thus focus on a few comments and illustrations that go 
somewhat off the beaten track. To start with, one should keep in mind that 
the Noether current is not uniquely determined by the assumed symmetry. The 
variation (4.7) only allows us to extract J“[w, x] up to addition of a vector field 
whose divergence vanishes off-shell. Such a modification of the current does not 
affect the conservation law (4.6). 

There is another, somewhat more subtle ambiguity in the Noether current, related 
to the definition of the transformation used to produce (4.7). Namely, it would be 
tempting to simply take (4.1) and make it local by replacing € — e(x). But there 
is a more general possibility that the localized transformation also depends on the 
derivatives of €(x). All that is required is that for constant €, the transformation 
reduces to (4.1). To see how this ambiguity affects the Noether current, consider a 
local transformation of the form 


Sw (x) = ex) F iy, x10) + One (x) KL, x] (x) , (4.9) 


where K‘“[y, x] is an arbitrary local function of the fields and their derivatives. To 
keep things simple, I have used the evolutionary form of the transformation where 
the coordinates x” do not change. Applying (4.7), one finds that the new term in the 
transformation rule for y’ shifts the Noether current by a term linear in K'“[w, x], 


8S 
Byi (x) © 


JPY, x] Q@) = IML W, x] () tar K'" Tw, x](x) (4.10) 


The argument is easily generalized to transformations depending on arbitrarily high 
derivatives of €(x). All the ensuing corrections to the current are proportional to 
85/8y', which defines the EoM for the fields. The moral is that the ambiguity in the 
definition of the localized symmetry transformation leads to new contributions to 
the Noether current that vanish on-shell [6]. Neither this ambiguity does therefore 
affect the conservation law (4.6). 
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Example 4.3 


By the naive replacement € — €(x), we can localize the action of spacetime 
translation in the x’-direction as 8ywi(x) = —e(x)d,wi (x). Assuming that the 
Lagrangian density does not depend explicitly on the coordinates and on higher 
than first derivatives of w', (4.7) gives us the set of currents 


OL 


TH, =L — 
apy") 


dayi. (4.11) 


This is the familiar canonical energy-momentum (EM) tensor. (The unusual overall 
sign is a consequence of the conventions used here.) This EM tensor is known to 
possess some undesired features that have inspired various “improvements” with a 
history as long as Noether’s theorem itself. For an illustration, consider a relativistic 
theory of a scalar field @ and a vector field A,,, defined by 


Zip, A] = A” d,g — 5AM Ay (4.12) 


The canonical EM tensor of this theory is T“” = g”. Z — Ad"; I have raised 
the second index with the flat Minkowski metric g“”. This EM tensor is notably not 
symmetric, which is a general trait shared by theories that contain nonscalar fields. 
This is quite troubling, if only for the fact that (4.12) is actually the free massless 
scalar theory in disguise. Indeed, using the EoM for the vector field, A, = d,@, 
turns the Lagrangian into 7[¢] = (1/2) (ðu). 

One way to solve this problem is to note that the naive local translation, 
õp Au (x) = —€(x)d, A, (x), is not compatible with the EoM for A,,. Under a local 
translation, A, should transform as a covariant vector field. Let us therefore try 


vh (x) = =E) AGO), SyAp (x) = =E (x) Au x) — Ar) ILE) . 
(4.13) 


This is a generalized local transformation of the type (4.9), which leads to the 
correspondingly modified EM tensor, TH” = gH” Z + AH A” — (AH 3” + A” AKO). 
This is symmetric off-shell, which is ultimately because the transformation (4.13) 
has a well-defined geometric meaning [6]. In the language of Appendix A.3, (4.13) 
represents the Lie derivative of ¢ and A,, along the vector field —e(x)0,. 


The above example shows that the ambiguity in the definition of the local 
transformation used to produce the Noether current is not just a nuisance. It may 
be used as a tool to construct “improved” Noether currents with desired properties. 
Quite recently, this idea was exploited to systematically construct EM tensors 
tailored to Lorentz, scale and conformal symmetry [7]. For further background 
and references on Noether’s theorem and improvement of Noether currents, see for 
instance [8]. 


52 4 Symmetry and Conservation Laws 


4.2.2 Tensor Conservation Laws 

Before closing the discussion of Noether’s theorem, let me stress that (4.6), while 
generic, is not the only form the ensuing conservation law may take. It may happen 
that the variation of the action (4.7), or its parts, only depends on higher than first 


derivatives of €(x). This leads to conservation laws with likewise higher than first 
derivatives of a generalized tensor current. 


Consider the theory of a free real Lifshitz scalar field @ [9], 


1 1 
Zip] = 5 (09) - 5 rds)” (4.14) 


This Lagrangian is, just like that of the massless relativistic scalar theory, invariant 
under the shift ¢ — p + €. Making the replacement € — €(x) leads to the variation 


8S = f dx {J'e E) — I E)E}, (4.15) 


where J’[ġ] = 3o% and J” [p] = 4,4,@. The corresponding on-shell conservation 
law takes the form 


oJ lA] + drðs Jl] = 0. (4.16) 

This is, unsurprisingly, equivalent to the EoM for @. 
From now on I will always assume that the coordinates x” include both space and 
time. We can then integrate the local conservation law (4.6) over space. Assuming 


asymptotic behavior of the fields at infinity such that the boundary term produced 
by integrating 0,J”[w, x] vanishes, we conclude that the integral charge 


Q= fes JÌ iy, x(x) (4.17) 


is time-independent. Curiously, a local conservation law of the type (4.16) is 
stronger. Namely, apart from conservation of Q, it also implies time-independence 
of 


Q' = f is Pwo. (4.18) 


This looks like the dipole moment of the charge distribution defined by the density 
J°[y, x]. Conservation laws of the dipole type (4.16) have recently attracted 
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considerable attention due to their relevance for so-called fracton phases of matter. 
The reader is referred to [10, 11] for more details about this intriguing subject. 


4.3 Symmetry and Conservation Laws 
in Hamiltonian Formalism 


The Lagrangian formalism provides a natural framework for the discussion of 
symmetries and conservation laws, yet it also offers numerous other benefits. 
For applications to effective field theory (EFT), it is particularly important that 
the formalism works without change for Lagrangians depending on higher field 
derivatives. Also, it makes the transition from classical to quantum theory straight- 
forward within the path-integral approach to quantization. However, we will see 
in Chap. 5 that the most striking manifestations of spontaneous symmetry breaking 
(SSB) include peculiar properties of the quantum ground state and the spectrum of 
excitations above it. Such features are not easily addressed using path integrals; here 
it is more natural to use the operator approach to quantization. 

With this in mind, I will now devote some space to the Hamiltonian formalism, 
which is the classical counterpart of the operator language of quantum field theory. 
This comes at the cost of having to restrict the discussion to theories whose action 
functional depends only on the first time derivatives of fields. In return, we gain the 
reverse of Noether’s theorem, allowing us to extract the corresponding symmetry 
from a given conservation law. What follows is a brief survey of the symplectic 
formulation of the Hamiltonian formalism, adapted to local field theory. The 
background developed here will prove useful in Chap. 8. For a further generalization 
of the approach outlined below, see for instance [12]. A reader interested in modern 
developments of this approach is advised to consult [13]. 


4.3.1 Symplectic Formulation of Hamiltonian Dynamics 


The starting point is a manifold called the target space of the given theory. This 
manifold carries a geometric structure fixed by a (locally defined) 1-form w called 
the symplectic potential. I will use the notation œw = «;(&)dé' where é! is a set of 
(local) coordinates on the target space. The phase space of the theory consists of 
all time-independent fields taking values in the target space. With some abuse of 
notation, I will denote such fields as ¿i (x). The Hamiltonian of the theory is a local 
functional on the phase space, H = f dx #/[£, x](x). As indicated by the square 
bracket notation, the Hamiltonian density # is a local function of the fields and 
their spatial derivatives, possibly also depending explicitly on spatial coordinates. 
The action is now a functional of trajectories on the phase space, which I with 
some further abuse of notation denote as ¿i (x,t) = ¿i (x). It is fixed by the choice 
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of symplectic potential and Hamiltonian, 


S= / dx {a EDE (x) — HIE, ¥](x)} . (4.19) 


The EoM for the variational principle based on (4.19) takes the form 


dwj E) dei) 


is 8H 
U E@)N)E! X) = Sa 2j (€) = ag! aé/ 


SEFC) (4.20) 
Here §/8é! (x) indicates taking the variational derivative of a functional on the phase 
space with respect to £' (x), and substituting the trajectory £ (x) in the result. The 
antisymmetric matrix {2;;(€) collects the components of the symplectic 2-form, Q = 
(1/2); (E )dé! A dé/ . This is an object of central importance for the Hamiltonian 
approach to mechanics and field theory. In the language of Appendix A.8, it is a 
closed 2-form, for it is related to the symplectic potential via Q = dw. Moreover, 
the matrix {2;;(€) is assumed to be nonsingular so that (4.20) constitutes a complete 
set of evolution equations for the fields £. One can then rewrite (4.20) as 


SH 
SEI (x) | 


E(x) = QU (E(x) (4.21) 


where Q’ (E) is the matrix inverse of 2; ;(&). 


Consider a Lagrangian field theory of n scalar fields ¢', taking values in R”. The 
target space of the Hamiltonian description of this theory is R” x R”, spanned on 
the fields ¢' and their conjugate momenta z;. The Lagrangian and Hamiltonian 
densities are related by the Legendre transform, Z = 2;¢' — #. Matching this 
to (4.19) allows one to identify the symplectic potential, w(@, 7) = ;dd!'. The 
symplectic 2-form in turn becomes Q(@,7) = dz; A dø’, and the EoM (4.20) 
reduces to the familiar form 


8H ; 5H 
—— (x) =-=. 
òmi (x) 5d! (x) 


! (x) = (4.22) 


In general, coordinates ¢', 7; on the target space in which the symplectic 2-form 
acquires the simple form Q(¢, 7) = dr; A dd! are called Darboux coordinates. 
By the Darboux theorem, such coordinates exist at least locally on any manifold 
endowed with a symplectic 2-form (see for instance Sect. 43 of [14]). The global 
existence of Darboux coordinates on the target space is however not guaranteed. 
In fact, it is ruled out whenever the symplectic 2-form is closed but not exact. 
Example 4.7 below provides a nontrivial illustration of this possibility. The geome- 
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try of manifolds carrying a symplectic structure is therefore locally identical to that 
of R” x R”, but may be globally nontrivial. 


With the symplectic 2-form at hand, one can define the Poisson bracket of any 
two local functionals F, G on the phase space, 


SF 8G 
SEI (x) SEI (x) ` 


{F,G}= f dx QÏ (E(x)) (4.23) 


A special case is the fundamental Poisson bracket of the phase space coordinates, 


Eia), E/ (y)} = QY (E(x) E — y). (4.24) 


In local Darboux coordinates, (4.23) reduces to the textbook definition of Poisson 
bracket in terms of derivatives with respect to canonical coordinates wi (x) 
and momenta z;(x). Likewise, (4.24) generalizes the fundamental brackets 
(W(x), WO) = (ix), o) = 0 and {yi æ), rjo) = 8454 — y) in 
a way independent of the choice of coordinates on the target space. It is easy to 
check that (4.23) has the following properties. First, {F, G} is obviously linear in 
both arguments and antisymmetric. Moreover, it satisfies the Leibniz (product) rule. 
Finally, it satisfies the Jacobi identity; this takes more effort to prove and requires 
using the closedness of the symplectic 2-form. 

The Poisson bracket offers a very compact expression for time evolution. For any 
local functional F on the phase space that does not depend explicitly on time, 


F= fate al ti (x) = {F, H}, (4.25) 
8&! (x) 


where I used (4.21). The latter is itself equivalent to ¿i = {&', H}. In this form, it is 
natural to think of time evolution as a flow on the phase space, generated by H. 

This completes the preparation required to address the main objective of this 
survey of the Hamiltonian formalism: the connection of symmetries and conser- 
vation laws. First, everything said in Sect. 4.2 about Noether’s theorem remains 
valid in the present Hamiltonian setting. For the sake of identifying the Noether 
current, there is no difference between the action functionals (4.5) and (4.19). We 
can therefore conclude at once that a continuous symmetry of the action (4.19) 
implies the existence of a current J“[&, x] conserved on-shell. This current can 
be identified by localizing the symmetry transformation and using (4.7). 
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The Hamiltonian density of a free massless relativistic scalar field ¢ is #[@, 7] = 
(1/ 2r? + (1 / 2)(Vo)?. The symplectic potential takes the Darboux form, 
w(¢, m) = md¢@. The action functional 


S= f dx frwd) - Tro? = sve? (4.26) 


is invariant under the constant shift @ —> @+ € with z kept unchanged. Making the 
shift coordinate-dependent, the action varies by 8S = f dx [r (x)ð3oe (x) — Vo(x)- 
Ve(x)]. This leads to the identification of the temporal and spatial components of 
the Noether current as J°[¢, n] = 2, J’[¢, 1] = —3-p = +0". Upon using the 
EoM for @, ġ = 7, the current is seen to coincide with that derived in Example 4.2, 


J [p] = 0" 6. 


What is new in the Hamiltonian framework is that we can now reverse Noether’s 
theorem and reconstruct the symmetry from a given conservation law. By (4.25), 
a local functional F that does not depend explicitly on time defines an integral of 
motion (conserved charge) if and only if {F, H} = 0. Moreover, if F and G are 
both conserved, then so is {F, G} thanks to the Jacobi identity. Thus, all conserved 
charges of the theory furnish a Lie algebra with respect to the Poisson bracket. 
Finally, every conserved charge generates a flow on the phase space, defined by the 
infinitesimal transformation 


ôF 


BE (x) = elg (E), F} = QUE) ETG - 


(4.27) 


We thus have an explicit realization of a geometric transformation, associated with 
a given conserved charge. It remains to demonstrate that this is in fact a symmetry 
of the action (4.19). To that end, a short calculation using the definition of the 
symplectic 2-form shows that under the transformation (4.27), the action varies by 


èS = -e fas oF a)-e f arth, F(t). (4.28) 
SE! (x) 


The first term vanishes upon integration, being a total time derivative. The second 
term vanishes by the assumption that F is conserved. This completes the proof 
that every conserved charge (that does not depend explicitly on time) generates a 
symmetry of the action. 
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The low-energy physics of ferromagnets can be encoded in a continuous field theory 
whose basic degree of freedom is the local spin (magnetization) density. In the 
first approximation, fluctuations of the ferromagnetic equilibrium correspond to 
local changes in spin orientation, while the magnitude M of spin density remains 
constant. The target space is thus equivalent to the sphere S*. The local spin variable 
n'(x) is a unit three-vector field that satisfies the fundamental Poisson bracket 


' ; lo 
{n'(x), n O) = u 1 næ a — y), (4.29) 


reflecting the Lie algebra of angular momentum. This is a prominent example of a 
system with a phase space whose symplectic structure is largely fixed by symmetry 
alone. Other examples can be found in [15]. 

For any Hamiltonian that is a local functional of n’, the ensuing EoM is ñ! = 
{n', H}. Using the chain rule, this can be put into a neat vector form? 


1 8H 


WO) = FF Ge) 


x n(x), (4.30) 


known as the Landau-Lifshitz equation. The concrete form of the Hamiltonian 
depends on the physical system. For isotropic ferromagnets, the simplest choice 
is quadratic in gradients of n(x), as suggested by the Landau theory of phase 
transitions, 


H= [os 5 ou Vn (x) -Vni (x). (4.31) 


Here os is the so-called spin stiffness. The EoM then reduces to 


AG) = Sna) x V2n(x) . (4.32) 


Matching (4.30) to the general EoM (4.20) allows one to extract the symplectic 
2-form on S?, corresponding to the Poisson bracket (4.29), 


M tasj k 
Q= =F En dn! ^ dn“ . (4.33) 


This is, up to normalization, the area 2-form on S2, see Example A.7 in 
Appendix A.3. The area 2-form is closed but not exact. Hence the sphere $? is 
an example of a symplectic manifold on which globally well-defined Darboux 


? This is a slight abuse of notation. I am using boldface italics both for spatial vectors living in R? 
with arbitrary d and for the spin vector living on S? C R?. 
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coordinates do not exist. This fact has deep consequences for the topological 
properties of ferromagnets. As to symmetry, the symplectic 2-form (4.33) and 
the Hamiltonian (4.31) are both manifestly invariant under SO(3) rotations of the 
spin variable n'. This symmetry corresponds to the conservation of total spin, 
S = M f d% n(x). 

Ferromagnets are fascinating materials that exhibit many of the nontrivial 
features of SSB. This example is just a taster; in Sect. 9.2, I will serve the reader 
a much more thorough discussion of the low-energy physics of spin systems. 


4.3.2 Symmetry in Quantum Physics 


Our discussion of symmetries in field theory has been strictly classical so far. 
The implementation of symmetries in quantum field theory comes with numerous 
subtleties. Some of these are related to SSB and I will return to them in the next 
chapter. Here I will therefore just briefly outline the transition from classical to 
quantum physics. As is well known, this transition is streamlined in the Hamiltonian 
formalism. Local functionals on the phase space F,G,... are replaced with 
operators F,G,... on the Hilbert space of physical states. The commutator of these 
operators is then obtained from the Poisson bracket of their classical counterparts, 
roughly speaking, by the replacement {F, G} —> —i[Ê, G]. In the following, 
I will drop the hat on operators, but otherwise closely follow the Hamiltonian 
representation of symmetries outlined above. For the sake of simplicity, I will only 
consider conserved charges that do not explicitly depend on time. 

With this qualification, a Hermitian operator Q represents a conserved charge of 
a quantum system if and only if it commutes with the Hamiltonian, [Q, H] = 0. 
By analogy with (4.27), the conserved charge generates a flow on the algebra of 
observables. For a given Hermitian operator A, the shift induced by Q is 


8A = —ie[A, Q]. (4.34) 


This can be extended to a transformation with a finite parameter €. One can think of 
such a transformation as a map A — A(€) where A(e) satisfies the “flow equation” 


dA(e) . , 
d ~ —i[A(e), Q] = i[Q, A(€)] . (4.35) 
This has the formal solution 
A(e) = L2 A(O)e 2 | (4.36) 


The duality between the Schrödinger and Heisenberg pictures of quantum mechan- 
ics suggests an interpretation of (4.36) in terms of a formal unitary operator 
U(e) = e€2, acting on the Hilbert space of states. Instead of transforming 
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operators via (4.36), we could then equivalently transform physical states by U (e€). 


In either case, the fact that Q is a conserved charge is reflected by the invariance 
of the Hamiltonian under the transformation generated by Q, H(e) = H or 
[U (€), H] =0. 


As I will demonstrate in Sect. 5.3, when the symmetry in question is sponta- 
neously broken, the operator U (€) may in fact not exist. This is one of the 
quirks of SSB. The symmetry may not be realized by unitary operators on the 
Hilbert space. As we will see, this feature is related to the nontrivial structure 
of the Hilbert space in presence of spontaneously broken symmetries. Even 
then, it makes sense to ask how the symmetry affects results of measurements. 
The transformation of physical observables under the symmetry remains well- 
defined and is still expressed by the flow equation (4.35). 


The starting point of the discussion in this chapter was that a definition of 


symmetry requires a transformation and an object. All the objects we have worked 
with so far—actions, Hamiltonians, and EoM—capture the dynamics of the entire 
physical system. It is however no less interesting and useful to study the symmetries 
of a particular state of the system, whether classical or quantum. This brings us to 
the realm of SSB, which will be addressed in detail in the next chapter. 
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A 
| Check for | 


updates 


In this chapter, I will finally spell out a formal definition of spontaneous symmetry 
breaking (SSB). While the previous chapter was largely phrased in terms of classical 
field theory, I will now switch entirely to the quantum-theoretic language. This 
is more than justified by Sect.5.3, which gives the reader a flavor of the many 
subtleties, associated with SSB in quantum systems. Some further details on the 
physical aspects of SSB can be found for instance in the recent lecture notes [1]. 
A mathematically oriented reader will find the classic book [2] a unique source of 
additional information. 

To motivate what comes below, let me briefly return to the roots of Lie group 
theory in the study of symmetries of differential equations. Suppose that we 
have a set of (partial) differential equations invariant under some group G of 
transformations. A given solution to the equations can fall into one of two classes: 


(i) The solution is not invariant under G. This is the generic case, in which one can 
use the symmetry to obtain new solutions by the action of transformations from 
G on the solution already known. 

(ii) The solution is invariant under G. Such solutions are special, and are usually 
easier to find than a general solution to the same equations. The reason for this 
is that assuming a priori G-invariance of the solution reduces the number of 
independent variables of the differential equations. 


The situation in quantum physics maps closely to what one does in the context of 
the theory of differential equations. The major difference is that one tends to focus 
on the eigenstates of the quantum Hamiltonian as the primary tool to investigate 
physical observables. For this reason, it is common to start with the ground state, or 
at least a metastable equilibrium state, and study the excitations above it. This is the 
quantum counterpart of the theory of small oscillations in classical mechanics [3]. 
We will see that quantum physics offers natural analogs to both of the above- 
mentioned cases (i) and (ii) one encounters in differential equations. In both cases, 
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symmetry has profound consequences for the structure of the excitation spectrum. 
For most of this chapter, I will focus on the symmetry properties of the equilibrium 
state. I will return to the excitation spectrum in Sect. 5.3, and then again in detail in 
Chap. 6. 


5.1 Physical State and Its Symmetry 


A proper discussion of SSB requires a number of new concepts. In order to make the 
narrative as natural as possible, I will augment it with some simple examples from 
condensed-matter and statistical physics. This will highlight the close connection 
between SSB and phase transitions. To parallel the language of statistical mechanics, 
I will represent the state of a quantum system with a density operator (or density 
matrix) @. Readers who need to refresh their memory of the density operator 
formalism will find an undergraduate-level introduction for instance in [4—6]. 

We already know that symmetries of quantum systems are represented, if only 
formally, by unitary operators. An operator U is said to constitute a symmetry of the 
(pure or mixed) state o if the density operators ọ and U@U" are indistinguishable 
by any measurement. The latter means equal probabilities for any specific outcome 
of any measurement, and by extension equal averages of all observables. Insofar as 
our description of the state of the system is free of redundancies, this implies that 
the density operators must be equal, 


UoU i o (symmetry of a state) . (5.1) 


Suppose that g is a pure state, that is @ = |W) (y| for some normalized ket-vector 
ly) in the Hilbert space. Then a unitary operator U represents a symmetry of ọ 
if and only if |y) is an eigenstate of U. One side of the equivalence is obvious: 
if |y) is an eigenstate of U, then UQU' = Q. To prove the opposite implication, 
write |(y| U |y)? as (YI U |W) (Yl UY |W) = (YI U@U" |W) = 1. Thus, the states 
|y) and U |W) are both normalized to unity and have a unit overlap, which is only 
possible if they are equal up to a phase. It follows that |y) is an eigenstate of U. 

In the following, I will frequently use a shorthand notation for the average of an 
observable A in the state o, 


(A)o = tr(0A) . (5.2) 


It is a simple corollary of (5.1) that a symmetry transformation of observables, 
A — U'AU, does not affect their average in symmetric states. To see this, 
write (U'AU), as tr(gU' AU) = tr(UoU' A) = tr(@A), which is just (A). This 
conclusion holds for discrete and continuous symmetries alike. 


5.1 Physical State and Its Symmetry 63 


5.1.1 Broken and Unbroken Symmetry 


Suppose now that the Hamiltonian of a system is invariant under a set of trans- 
formations that span a group G. These are represented, even if just formally, by 
a set of unitary operators U(g), g € G. A randomly chosen state ọ will not be 
symmetric under all of these. Those transformations that are symmetries of ọ will 
span a subgroup of G called the unbroken subgroup of ọ, denoted by 


Ho = {g € G|U(g)oU(g)* = 0}. (5.3) 


All other elements of G are said to be spontaneously broken in the state 9. This is 
our formal definition of SSB. 


Example 5.1 


Pure spin states of a spin-1/2 particle comprise a two-dimensional Hilbert space 

isomorphic to C7. This Hilbert space carries a representation of the spin group, 

G œ SU(2), generated by the spin operator S. Consider the projection of spin to a 

direction, defined by the unit vector n = (sin @ cos g, sin @ sin Q, cos 0), 

h h ( cos@ e sing 

-S=<n-t=-<|{. ; 5.4 
i 20T ar — cos 0 ) G4 

Here 0, ¢ are the standard spherical angles and q is the vector of Pauli matrices. The 

projected spin operator has an eigenvector |n, +) with eigenvalue +4/2, 


h cos 6/2 
n-S |n, +) >= 2 ln, +) ’ In, +) = (4s a : (5.5) 


Any nonzero vector in C? can, up to an overall factor, be cast in this form with 
a suitable choice of 6, œ. Hence, any nonzero vector in this Hilbert space is an 
eigenstate of a projection of the spin operator to some direction. We conclude that 
for any pure state of a spin-1/2 particle, the unbroken subgroup is isomorphic to 
U(1), and consists of spin rotations generated by n - S. 


How do we check whether a given symmetry is spontaneously broken in a given 
state o? For the density operators ọ and U@U" to be different, there must be at least 
one observable A whose average in the two states differs. This observable must itself 
not be invariant under the symmetry, as U'AU = A would imply 


(A)yput = t(UQU'A) = tr(QU'AU) = tr(@A) = (A)o . (5.6) 
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In order to detect SSB, one must therefore use an observable that itself breaks 
the symmetry. This fairly simple observation forms a conceptual foundation of the 
approach to SSB developed in Sect. 5.2. 

One observable that is by definition invariant under all the operators U(g) 
from the symmetry group is the Hamiltonian itself. Hence, for those g € G that 
are spontaneously broken, the state U(g)@U(g)' differs from ọ but has the same 
average energy. This presents a serious conundrum. Suppose that ọ is a purported 
ground state of a quantum system. From quantum mechanics, we are used to 
thinking about the ground state as a unique vector in the Hilbert space. But now 
U(g)QU(g)' for any g ¢ Ho is a different state of the same energy. How do we 
know which of the different states having the same energy to choose as the ground 
state? 

We noticed already in Chap.2 that SSB implies the existence of degenerate 
ground states. This property is so universal that it earned the privilege of being my 
first “moral lesson” in Sect. 2.4. But now our problem is even worse. In the classical 
analysis of Chap. 2, the various candidate ground states could be uniquely labeled 
by the expectation value of a scalar field. In the present fully quantum setting, we 
have not only the set of density operators U(g)oU(g)' with g € G. We can even 
make linear combinations of these states. In the desire to identify a unique ground 
state, we might wish to take a “democratic average” of all the would-be ground 
states, connected by transformations from G. This is an idea that certainly deserves 
at least consideration, even if it will eventually turn out to be physically incorrect. 
However, the related mathematical procedure of averaging over different elements 
of the symmetry group will actually prove quite useful. 


5.1.2 Symmetrization by Group Averaging 


When performing an average over the symmetry group, I will repeatedly make use 
of a simple statement that I will formulate as a standalone lemma. Let G be a finite 
group or a compact Lie group, and U(g) be its unitary representation on a finite- 
dimensional vector space. Then 


1 
P= —— U 5.7 
ree (g) (5.7) 


geG 


is a projector to the subspace of vectors invariant under G. In particular, if the 
representation of G by U(g) does not contain in its decomposition any singlet 
(that is, a trivial one-dimensional representation of G), then the action of P on any 
vector gives zero. The proof of the lemma is very simple and I will therefore leave 
it up to the reader. Let me just add a comment on the sum over group elements. 
Equation (5.7) as it stands is only valid for finite groups; |G| then denotes the 
number of group elements. For compact Lie groups, the sum has to be replaced by 
group integration with an invariant measure on G; |G| then stands for the volume 
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of G with respect to this measure. An interested reader will find more details in 
Chap. 3 of [7]. In the following, I will pragmatically use a sum over g € G as 
in (5.7) without repeating this disclaimer. 

Let us now get back to density operators. The action of the group G on these 
is defined by g : ọ —> U(g)QU(g)". This suggests that we consider the following 
statistical average, 


1 i 
b= — P_U(g)QU(g)' . (5.8) 


IGI oz 


This average still satisfies the requirements on a density operator: it is Hermitian, 
positive-semidefinite and has a unit trace. Moreover, it is manifestly invariant under 
all transformations from G. It therefore represents a (mixed) physical state of the 
system which is perfectly symmetric under the whole group G, yet has the same 
average energy as Q. 


In case the original state is pure, 9 = |Y) (Y|, we have the alternative choice 
to symmetrize the ket-vector |y) rather than the density operator, 


1 


P = 
Iv) > P |Y) IGI 


X uei). (5.9) 


geG 


This averaged ket-vector, if well-defined, represents a pure state of the same 
energy as |y) that is symmetric under the whole group G. The catch is that 
the ket-vector may be ill-defined. By our symmetrization lemma, this happens 
whenever |y) belongs to a representation of G that does not contain any 
singlet. Recall, for instance, Example 5.1. There, the whole Hilbert space C? 
constitutes an irreducible representation of the spin group G ~ SU(2). The 
naive average (5.9) therefore gives zero no matter which pure state we start 
from. On the other hand, the average density operator (5.8) must describe 
some physical state of the spin-1/2 particle. By Schur’s lemma, ọ must be 
proportional to the identity operator on C*. The requirement that tr@ = 1 
then fixes the normalization so that 9 = (1/2) id, for any choice of the initial 
state o. This mixed state describes an unpolarized distribution on the Hilbert 
space. 


It seems that we managed to have our cake and eat it. By the simple averaging 
procedure, we can make any state G-invariant without changing its energy. Have we 
done away with the whole notion of SSB? To see that the group averaging is not so 
innocuous, we need to work out the consequences. Consider a set of observables A! 
such that under the action of the operators U(g), they transform by some (real and 
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orthogonal) matrix representation R of G, 
UŻA Ul) =R(g)', Al,  geG. (5.10) 


Suppose that all these observables and their nontrivial linear combinations break 
(part of) the symmetry under G. In other words, the representation R does 
not contain any singlet of G in its decomposition. It then follows as a simple 
consequence of the definition of the density operator (5.8) and the symmetrization 
lemma that 


1 : 
(A')a = r(A’) = it Yo [Ugo (g)A'] 


== Y tr[oU(g)'A'U(g)] = T L OREA =0. 61D 
gEG gEG 
We could have anticipated this: in the G-invariant state 0, the average of any 
observable breaking the symmetry vanishes. 
Why should this be bad? Any quantum system possesses a distinguished state 
invariant under its symmetry group G: the thermodynamic equilibrium. This is 
represented by the canonical density operator 


_ | -pH 
Qp = ze ; (5.12) 
where is the inverse temperature and Z = trexp(—ß H) the canonical partition 
function. The claim therefore is that in thermodynamic equilibrium, any observable 
breaking the symmetry of the system must average to zero. There cannot be any SSB 
in thermodynamic equilibrium. This would be very pretty, were it not in a blatant 
contradiction with empirical evidence. 


Example 5.2 


Solid ferromagnets can be visualized in terms of a lattice of mutually aligned 
spins (see the left panel of Fig.5.1). The ferromagnetic state obviously breaks 
rotational symmetry. Moreover, the presence of the crystal lattice also breaks spatial 
translations. Both rotations and translations are fundamental symmetries that are 
expected to be preserved by the microscopic dynamics of any material. Hence 
ferromagnetism certainly is a macroscopic manifestation of SSB. 

In nematic liquid crystals (right panel of Fig. 5.1), long organic molecules are 
aligned along the same direction. However, the positions of the molecules are 
random as in ordinary liquids. As far as symmetry is concerned, the molecules can 
be represented by (undirected) rigid rods or ellipsoids with randomized positions. 
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ferromagnet nematic 


Fig. 5.1 Comparison of the symmetry-breaking states of ferromagnets and nematic liquid crys- 
tals. The symmetry of the former corresponds to that of a regular lattice of oriented, mutually 
aligned spins. Nematics, on the other hand, can be represented by mutually aligned undirected 
rods with random positions 


Hence the nematic state breaks rotational symmetry but not spatial translations. See 
Sect. 2.7 of [8] for basic phenomenology of liquid crystals. 


One of the greatest accomplishments of physics is the discovery that the 
fundamental laws of nature possess a high degree of symmetry. How is it then 
possible at all that we observe so many manifestly asymmetric macroscopic states 
of matter? There seem to be two logical possibilities to avoid a contradiction: 


e The observed asymmetric states are not in thermodynamic equilibrium, and thus 
may be at most metastable. 

e The asymmetry of the observed states is a result of perturbations breaking the 
symmetry of the microscopic interactions. 


Both of these possibilities are relevant. In order to understand their role, we have to 
dive deeper into the thermodynamics of physical systems with SSB. In the next 
section, I will focus solely on the canonical density operator ọg as the state of 
greatest importance for macroscopic observations. 


5.2 Effect of External Perturbations 


In Sect. 5.1.2, I introduced the set of operators A’ as mere observables. Let us now 
see what happens if we allow them to actually affect the dynamics of the system. To 
that end, I couple each A’ to an external field A; and perturb the Hamiltonian of the 
system by a term linear in both, 


H()=H—j,A'. (5.13) 
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In the case of a spin-1/2 particle discussed in Example 5.1, A’ can be chosen for 
instance as the components of the spin operator. The perturbations 4; can then be 
interpreted as an external magnetic field. In general, one can think of A; as a set 
of “chemical potentials” associated with the observables AŻ. In presence of the 
perturbations, the canonical density operator is deformed to 


1 
ZA) 


Qp. = ere. (5.14) 


and the modified partition function is Z(A) = tr e BAW — tr exp[—6(H — A, AŻ)]. 
The symmetry of the Hamiltonian H under the group G is reflected in the invariance 
of the partition function under the transformation à; > A;R)” į Of its variables. 
This follows from a simple manipulation, 


ZA) = tr [U (8) "OU (8)] = trexp[ - BU(g)' H@)U(s)] 
= trexp { — B[H —A;R(g)',A/]} = ZOR(g)) . (5.15) 


It is convenient to trade the partition function for the free energy F(A) by Z(A) = 
e PF). The free energy inherits the symmetry of the partition function under G- 
transformations of 4;. Moreover, it makes it easy to evaluate the statistical average 
of A’ in presence of the perturbations, 


. ; OF (A 
[A exp[- pH -ajA =- | (5.16) 


; 1 
(A'oga = ZO tr 


One of the distinguishing features of ferromagnets is that once magnetized by 
an external magnetic field, the macroscopic magnetization persists even after the 
external field is turned off. With this in mind, we will be particularly interested in the 
behavior of (A! Jog, When the volume V of the system is large and simultaneously 
i; is small. In other words, we want to take the double limit V —> oo and A; — 0. 
This is where interesting things start to happen. 


5.2.1 Taking the Thermodynamic Limit 


In a finite volume and for any nonzero temperature, the partition function, and thus 
the free energy, is analytic in A;. When all the 4; are small, one can therefore perform 
a Taylor expansion, 


F(A) = F0) + ea Ie OUP): (5.17) 
Ai tao 


But the linear term in the expansion is forbidden by the G-invariance of the free 
energy combined with the symmetrization lemma. Thus, in a finite volume and at 
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nonzero temperature, one has a well-defined limit 


lim (A'o, = 0. (5.18) 


This should not be particularly surprising. 

How does the behavior of free energy as a function of A; change when we 
take the limit of infinite volume or zero temperature? Some common mathematical 
features of these two limits may be discussed jointly. It is however the large-volume 
(thermodynamic) limit that is responsible for the existence of macroscopically 
distinct phases of matter. Only in special cases may the zero-temperature limit 
lead to nontrivial physics even in a finite volume; I will give an explicit example 
below. The general discussion of the effects of external perturbations will however 
be phrased in terms of the limit V — oo. The temperature will be assumed to be 
fixed and nonzero unless explicitly stated otherwise. 

It is convenient to trade the free energy for its density, F(A) = F(A)/V, which 
is expected to be well-defined for any volume, including the limit V — on. It is 
sensible to assume that even in this limit, the free energy density remains continuous 
as a function of 4;. However, the analyticity may be lost. With regard to the behavior 
of the average of A’ around the point A; = 0, there are three possible scenarios: 


e The free energy density remains analytic in the limit V — oo. This automatically 
guarantees the validity of (5.18). There is no SSB once the external field À; is 
turned off. 

e The free energy density becomes nonanalytic in the limit V — ov, yet (5.18) 
remains valid. This is an interesting borderline case that is relevant for the physics 
of low-dimensional systems. 

e The free energy density becomes nonanalytic in the limit V —> oo. The average 
(A! Jog remains nonzero even if the limit A; — 0 is subsequently taken. This is 
the paradigm of SSB, illustrated by the right panel of Fig. 5.2. 


y” 
y” 


Fig. 5.2 Schematic illustration of scenarios in which the free energy density F is (left panel) and 
is not (right panel) analytic at à; = O. In the latter case, the observables A’ may retain a nonzero 
average even in the limit à — 0. Its value may however depend on the way the limit is taken 
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An alert reader might wonder why I focused on possible nonanalytic behavior 
of F at à; = 0, yet tacitly assumed that analyticity survives the thermody- 
namic limit for nonzero A;. This is intimately related to the degeneracy of the 
ground (equilibrium) state. What I really assumed was that adding the external 
fields A; breaks the symmetry of the system sufficiently so as to make the 
ground state unique. I will further elaborate on this assumption in Sect. 5.2.2. 
The point is that the free energy density typically becomes nonanalytic as 
a consequence of a competition of degenerate ground states with different 
values of (A! Josx: This is something one expects only at the G-invariant point 
Ai = 0. 


It is instructive to work out at least one concrete example to verify our prediction 
how SSB may emerge when a particular limit of the equilibrium state is taken. 
Ideally, this example should be exactly solvable in order to leave no doubt about the 
validity of the conclusions. It is therefore time to meet the Ising model, the most 
well-known and best-understood model of ferromagnetism in statistical physics. 
The price for all its benefits is that the limit in which the free energy density becomes 
nonanalytic is realized by taking the temperature to zero; the V —> oo limit itself is 
nearly trivial. I will largely follow the treatment in Sect. 2.2 of [9]. 


Example 5.3 
Consider a one-dimensional chain of N spin variables o;, allowed to take values +1. 
The Hamiltonian is given by 
N N 
H=-J X oioi — 1) -B9 o, (5.19) 
i=l i=l 


where oy+1 is identified with o; to ensure a periodic boundary condition. The 
(positive) coupling J represents interaction of nearest-neighbor spins, and B an 
external magnetic field. In the absence of the external field, the Hamiltonian (5.19) 
has a G ~ Z2 symmetry under the spin flip o; > —o;. 

The partition function of the model is easily calculable thanks to the fact that the 
exponential of a sum equals a product of exponentials. Thus, the contribution of a 
spin configuration {oi} , to the partition function equals Te V (oj, 0141), where 


/ / BB / 
V(o,o') = exp [sreo —1)+ (o +0 | : (5.20) 
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The latter can be thought of as a matrix element of the so-called transfer matrix, 
BB ,-2BJ 
e e 
V= : 5.21 

(a) (5.21) 
With the periodic boundary condition on the spins, the partition function of the Ising 
model is now Z = tr VN = aN +A where à+ = cosh BB + v'sinh? BB + e747 


are the eigenvalues of the transfer matrix. The average spin at each site of the chain 
is obtained by applying (5.16) to F (B) = F(B)/N, 


sinh BB AN AN nc sinh BB 
> 


N N i 
J sinh? BB + e467 A+ tÀ- y sinh? BB + e7487 
(5.22) 


(Fi) og. = 


The total number of spins N plays the role of volume, and sending it to infinity 
amounts to the thermodynamic limit. 

It is obvious from (5.22) that the Ising model does not feature SSB at any nonzero 
temperature. Indeed, whether N is finite or not, the average spin necessarily vanishes 
in the limit B — O whenever the temperature is nonzero. On the other hand, taking 
the temperature to zero gives 


lim (oi)og g = sgnB , (5.23) 
B—>œ ` 


again regardless of whether N is finite or not. Taking subsequently a (one-sided) 
limit of vanishing magnetic field will render the average spin nonzero, with a sign 
aligned with that of the magnetic field. This can be traced to the properties of the 
canonical density operator, which in the zero-temperature limit becomes a projector 
to the subspace of states with lowest energy. For the Ising Hamiltonian (5.19), 


B>0: lim og p=|++---)(t4+--+1, 

Booo 
1 

B=0: li == bat sas Se E E 
jim, 08.8 FE (+++ )( |) 

B<0: lim gg =|-----)(----1, (5.24) 
Booo 

where |+ +---) and |— —---) denote respectively the normalized states with all 


spins being positive and negative. Note that the Z2-symmetric state on the second 
line of (5.24) is unstable: an infinitesimally weak external field will project it onto 
one of the two fully polarized states. 


The example demonstrates that taking an appropriate (infinite-volume or zero- 
temperature) limit in the presence of a perturbation may lead to an equilibrium state 


72 5 Spontaneous Symmetry Breaking 


violating the symmetry of the system. The system then remains in an asymmetric 
state even if the perturbation is subsequently removed. It is very important that 
the limits of infinite volume and vanishing external field are performed in this 
order. Indeed, taking 4; — 0 first would delete the average of any symmetry- 
breaking observable in line with (5.18). Colloquially, one often speaks of the 
“noncommutativity of limits,” 


lim lim (A')o,, # lim lim(A')o,, » (5.25) 


4-0 V>co V>oia->0 


as a smoking gun of SSB. 

This concludes our excursion to thermodynamics. We have seen why asymmetric 
macroscopic states of matter can—and why symmetric macroscopic states may 
not—be stable. Identifying a (meta)stable equilibrium is a necessary first step if 
we ever want to understand the spectrum of a quantum system. Our next goal is 
therefore to establish a general operational procedure for finding a stable ground 
state. 


5.2.2 Order Parameter and the Vacuum Manifold 


In the Ising model, it was enough to distinguish the two possible asymmetric 
ground states by the sign of the external field. This is eventually because the model 
possesses a discrete Z2 symmetry. In systems with continuous symmetry we expect 
a continuum of degenerate equilibrium states. An already familiar example is the 
isotropic ferromagnet, where the spontaneous magnetization can take an arbitrary 
direction in space. Thus, we must be careful when taking the limit of vanishing 
external fields. 

Suppose that we are given a specific set of fields, Îi. I will define the limit of 
vanishing perturbations by changing the magnitude but keeping the “direction” 
of these fields. This can be implemented precisely by using a positive scaling 
parameter €. We can thus construct a macroscopic thermal state 9°. by first taking 
the thermodynamic limit and then removing the external fields via ° 


Oni = lim lim gei- (5.26) 


e>0+ V>oo 


In presence of SSB, the average 


co = lim lim (AŻ), . 
ead ae ere Opi 


(5.27) 


is expected to be nonzero. This is an important proxy of SSB, known as the order 
parameter. It does exactly what its name suggests: parameterize the appearance of 
order in the thermodynamic state of matter described by the density operator QP. 
Is the state (5.26) stable? This question is closely related to the choice of 
observables A’. Namely, the set A’ should be “complete” in the sense that the values 
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of the order parameters (A! )o~, determine the thermal state @°°. uniquely. In the 
limit of zero temperature, thigh cans that oF . Should be a pure state, analogously 
to the B Æ 0 states in (5.24) for the Ising model. 


Example 5.4 


In the Ising model, the role of the order parameter is played by the average spin (07). 
According to (5.23), in the zero-temperature limit in the presence of a nonzero 
magnetic field, (o;) acquires one of two possible values. These uniquely specify 
the corresponding symmetry-breaking pure ground states via (5.24). 

In an isotropic ferromagnet, the order parameter can be taken as the average 
of the operator of total spin (magnetization) S. Specifying the vector (S) is then 
sufficient to determine the stable macroscopic equilibrium uniquely. This is however 
not necessarily the only possible choice of order parameter. Instead of the three 
components of the vector S;, we might try to consider for instance the traceless 
symmetric tensor 7;; = S; Sj; + Sj Si — (2/3)ôij S?. This set of operators transforms 
under a nontrivial irreducible representation of the spin group G ~ SU(2), hence it 
satisfies the requirements we have imposed on the operators AŻ. The average (Tij) 
however, does not specify a unique equilibrium state. Namely, this order parameter 
cannot distinguish between the two states that only differ by the overall sign of (S). 


The order parameter is usually much easier to determine, or at least estimate, 
than the actual equilibrium state, 9°°.. Its real value lies in the fact that it carries 
the same information about the symmetry of the equilibrium state as the density 
operator One itself. To see this, just note that 


(Auge ugt = tU UA] =RG) (Ao . (5.28) 


This shows that for any g € G that is a symmetry of One the matrix R(g) leaves 
the order parameter unchanged. But the opposite is true as well. For any g € G such 
that R(g)' (AJ) 920. = (A!) o~, , the assumed completeness of the set of operators 
A! guarantees that ‘the states foe 3 and U (g)o%. U (g)" are the same. The pattern of 
SSB in a macroscopic equilibrium state is therefore completely determined by the 
associated order parameter(s). 

Finally, let me briefly return to the correspondence between the order parameter 
and the pure ground state (vacuum) obtained by taking the zero-temperature 
limit of (5.26). It is in principle possible that the physical system possesses two 
completely unrelated ground states of equal energy. (This happens for instance at 
a first-order phase transition.) Barring such accidental degeneracy, however, one 
expects the possible candidate vacua to be mutually related by (broken) symmetry 
transformations. It is therefore sufficient to know just one of the vacuum states; it 
should be possible to reconstruct all the others by the action of the operators U(g), 
g € G. We will see in Sect. 5.3 that this procedure is subtle in the limit of infinite 
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volume. What one can however do safely is to trade the chosen vacuum state ọ 
for the corresponding order parameter (A! )g- All other possible values of the order 
parameter can then be generated by acting on (A‘)» with the matrices R(g). 

In case G is a Lie group, the possible vacuum values of the order parameter span 
a manifold of dimension dim G — dim Hy; see Appendix A.1 for the mathematical 
background. This is known as the vacuum manifold. The central tenet of this book 
is that the physics of systems with spontaneously broken continuous symmetry can 
be captured by a low-energy effective field theory (EFT). The form of this EFT 
is largely fixed by the geometry of the vacuum manifold. The latter can in turn 
be understood in terms of G and its unbroken subgroup Hy. To develop the EFT 
framework based on the pattern of SSB is the goal of Parts III and IV of the book. 


Example 5.5 


Quantum chromodynamics (QCD) is a gauge theory of the strong nuclear inter- 
action. Strongly-interacting matter is represented by the quark field Y”. This is a 
Dirac spinor where œ is an index of the fundamental representation of the SU(3) 
gauge group of QCD. The index i represents quark flavor and belongs to the 
fundamental representation of SU (n¢). In applications, the number of relevant (light) 
quark flavors nt is usually two or three. 

In the limit of vanishing quark masses, QCD possesses a G ~ SU(nf)L x 
SU(nf)r symmetry, consisting of independent flavor transformations of the left- 
and right-handed components of W”. In the ground state of QCD, this symmetry 
is spontaneously broken by the complex order parameter (%;;) = (Sap Wi VfR). 
Note that tr(£ + 47) is the usual mass term for a Dirac spinor, here with equal 
masses for all quark flavors. We can therefore think of “switching on” small equal 
quark masses as an analog of the external field A; that selects a particular vacuum 
state. In this vacuum state, (X;;) = o64;;, where the constant ø is an intrinsic 
scale of QCD. The unbroken, “vector” subgroup of G, H ~ SU(n¢)v, consists of 
identical unitary transformations of left- and right-handed quarks. Broken symmetry 
transformations convert this vacuum to other vacuum states, in which (2j;)/o is 
unitary. The vacuum manifold is therefore diffeomorphic to the Lie group SU(m). 


5.2.3 Intermediate Summary 


We have taken quite a conceptual tour to arrive at the notions of order parameter and 
vacuum manifold, essential for understanding the physics of SSB. Let us therefore 
make a little break and review what we have done in this chapter so far. 

I started Sect. 5.1 with a basic definition of a state of a quantum system in terms of 
a density operator, and of its symmetry. Section 5.1.1 makes the distinction between 
broken and unbroken symmetries, and highlights energetic degeneracy of different 
states as a universal feature of SSB. A naive way out of the puzzle this poses 
might be to consider only states respecting the full symmetry G of the system. In 
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Sect. 5.1.2, I motivated the consideration of such G-symmetric states by recalling 
the thermal (canonical) density operator. The latter however renders the average 
of any observable breaking the symmetry zero. This is in a striking contradiction 
with the existence of asymmetric, macroscopically stable states of matter in nature. 
A careful consideration of the effects of small external perturbations in Sect. 5.2.1 
shows that these may drive the system towards a symmetry-breaking state. In fact, in 
the large-volume limit, even an infinitesimally small perturbation may be sufficient 
to destabilize the G-invariant thermal state. 

For most applications, it is mandatory to build a physical description of a given 
system upon a stable ground state. Such vacuum states are constructed in Sect. 5.2.2 
by taking the thermodynamic limit in presence of a perturbation, before the latter 
may be switched off. The different, degenerate vacuum states can now be labeled 
uniquely by different values of an order parameter, corresponding to the average of 
a suitably chosen set of observables. The choice of the order parameter itself is not 
unique. Depending on this choice, the same vacuum state may be represented by a 
point in different order parameter spaces. However, the set of all degenerate vacuum 
states spans a vacuum manifold, whose geometric structure is determined solely by 
the symmetry group G and its unbroken subgroup. 

This completes the background needed to understand the following chapters. 
When talking about a vacuum or ground state, I will always implicitly have in mind 
a pure state, obtained by the limiting procedure in (5.26). Also, I will from now on 
always assume vanishing thermodynamic temperature. Nonzero temperature was 
only used in this chapter as a tool to avoid singularities in the partition function. 

The story of SSB, however, does not end here. A closer look reveals a number 
of intriguing aspects that I have skipped so far in the desire to provide a minimal 
self-contained introduction to SSB. While not of direct relevance for the rest of the 
book, it would be a pity to omit these aspects altogether. I therefore mention some 
of them at least briefly in the next section, in the form of a case study. 
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In the previous section, the thermodynamic limit played a crucial role in selecting 
a unique symmetry-breaking ground state. This begs the question of what one can 
expect in volumes that are large yet finite. After all, real macroscopic systems in 
nature certainly are finite, albeit possibly large from the point of view of microscopic 
physics. To what extent are then symmetry-breaking states stabilized by an external 
perturbation? Besides, how does the finite-volume quantum ground state look in the 
absence of perturbations? 

These are some of the questions I will address here. I am not aware of any formal 
framework that would allow one to tackle these questions in full generality. In fact, 
the answer to some of them may depend on the specific physical system. Instead 
of trying to be general, I will therefore work out in detail one concrete example, 
just to illustrate what is at stake. Some rigorous results on the spectrum of quantum 
systems with SSB in a finite volume can be found in [10] or in Part I of [11]. 
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5.3.1 Free Schrödinger Field in Finite Volume 


Following Sect. 3 of [12], consider the theory of a free Schrödinger field y, defined 
by the Lagrangian density 


L=iw' dv — vy’ Vv, (5.29) 


where m is a mass parameter. I will quantize this theory in a d-dimensional box of 
size L, with volume V = Lf, endowed with a periodic boundary condition. The 
boundary condition admits only discrete values of momentum p, given component- 
wise by 


2 Ny 
Pr — L 


> n EZ. (5.30) 


The field operator can be Fourier-expanded in terms of plane-wave solutions to the 
Schrödinger equation. In the Schrödinger picture, 


1 , 
v(x) = W 3 ayer. (5.31) 


where ap is the annihilation operator for a single-particle state of momentum 
p. This is normalized so that together with the creation operator a}, it satisfies 
lap, aq] = ôpq. The normalization of the plane-wave expansion by the factor 1/ JV 
then guarantees that the canonical coordinate y(x) and its conjugate momentum 
iy’ (x) obey the canonical commutation relation [Y (x), iyt (y)] = iô? (x — y). 

Within second quantization, all these operators act on a Hilbert space, built using 
the Fock construction. The starting point is the Fock vacuum |0), defined by the 
condition ap |0) = O for all allowed values of p. An orthogonal basis of the space 
is then obtained by acting on |0) with a finite number of creation operators in all 
possible ways. The Schrödinger Hamiltonian has a simple expression in terms of 
the annihilation and creation operators, 


1 p? 
Hac J d% Vi (x) - Vy(x) = 2 5 opt : (5.32) 


This is a positive-semidefinite operator. The lowest (zero) energy is reached by any 
state |Q) in the Hilbert space that satisfies 
dp |2)=0 forall p40 (ground state) . (5.33) 


There are infinitely many such states in the Hilbert space, in particular any state 
obtained from |0) using the zero-momentum creation operator aj. Hence, the 
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subspace of states of zero energy is identical to the Fock space of a linear harmonic 
oscillator. 

The infinite degeneracy of the ground state hints at SSB. To see what symmetry 
might be spontaneously broken here, let us think what conserved charges there are 
in the first place. There turn out to be infinitely many of them. One can take for 
instance any operator of the form 


Qs =} f(patay = | d% wi (x) f(-iV) W(x) , (5.34) 
P 


where f is assumed to be a real analytic function. This class of operators includes 
the Hamiltonian itself, with f(p) = p’/(2m), the momentum operator P = 
a papap, and the operator of particle number Q1 = X > aja p- From (5.34) it is 
obvious that all the operators Q ¢ mutually commute with each other, hence all are 
conserved. By (4.34), they induce a local transformation of w(x) with parameter €, 
òy (x) = —ie f(—iV) w(x). In addition to the class (5.34), any Hermitian operator 
constructed out of ag and a commutes with the Hamiltonian, and thus represents a 
conserved charge as well. Not all of these, however, descend from a local Noether 
current. Two notable exceptions are the two possible Hermitian operators linear in 
ag and a . I will list them alongside the symmetry transformations they generate: 


On =i f ay -viO = VV a) y> y-e, 
(5.35) 
Q= fas (Wie) + yi] = VV ao ag) , Yy —>y-—ie. 


With all the symmetry transformations at hand, the associated Noether currents can 
be extracted from the Lagrangian (5.29) using the technique outlined in Sect. 4.2.1. 
The Lie algebra of our conserved charges is defined by the commutators 


[Q r, Or] = ~if (0) Q1, [Of, Qr] = +i f 0) QR , [Or, Qr] = 2iV . 
(5.36) 


These make it natural to split the space of charges Q p into the one-dimensional 
subspace of charges proportional to Q1, and a subspace of charges with f (0) = 
0. The latter commute with Qp, Qr and moreover annihilate any ground state 
I2} as defined by (5.33). We can therefore drop them and focus on the set 
Q1, Or, Qr. By (5.36), this set furnishes a central extension of the Lie algebra 
iso(2), corresponding to Euclidean transformations in the complex plane of y. 
The generator Qı induces (phase) rotations, whereas Or, Qy generate respectively 
translations of the real and imaginary parts of y. 

The Fock vacuum |0) is annihilated by Q1, but not by Qr, Q1. This suggests that 
alternative vacuum states may be obtained from |0) by transformations generated by 
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Or, Q1. We can pick any z = z; + iz2 € C and define a new vacuum state by 
|z) = exp li OR + z201)| |0) = exp [VV ai — z*ao)] 10) . (5.37) 


This is a coherent state in the Fock space of zero-energy states. Indeed, it is easily 
seen to be an eigenstate of the annihilation operator, ao |z) = <Vz|z). In fact, 
|z) is simultaneously an eigenstate of the field operator at any point in space, 
w(x) |z) = z|z). We thus end up with a continuum of vacuum states, labeled by 
a complex parameter z. The order parameter distinguishing these states from each 
other is provided by the expectation value of y(x). The vacuum manifold is the 
complex plane C. For any choice of z, the state |z) possesses an unbroken SO(2) 
symmetry, corresponding to rotations of the complex plane around the point z. Its 
generator is Q1 = Q1 +z2 Qr —z1 Q1. The state |z) is an eigenstate of this operator, 
21 |z) = —V |z|*|z). 

Interestingly, it is not possible to select the state |z) by adding to the Hamiltonian 
a perturbation linear in yw as I did in Sect. 5.2. That would make the Hamiltonian 
unbounded from below. This is the price to pay for the vacuum manifold not being 
compact. An alternative is to add to the Hamiltonian a chemical potential u for the 
conserved charge * Q), so that it becomes H, = H — u? Q1. The expectation value 
of such a perturbed Hamiltonian in the state lz! ) with any z’ € Cis 


2 

(z| Hy |z) = -Vaf le -= zl = Iz]. (5.38) 
An infinitesimally small negative u will then select |z) as the unique ground state 
with energy V u [z|*. 
5.3.2 Pathologies of the Infinite-Volume Limit 
The analysis of the free Schrödinger theory has been perfectly clean so far. Most of 
the properties of the quantized theory match their classical counterparts, in particular 
the existence of infinitely many conservation laws, and infinitely many degenerate 


ground states. What happens when we try to take the limit of infinite volume? Let 
us evaluate the overlap of two different coherent states of the type (5.37), 


Kill? = uolo? = exp (- Y jz- z?) 
U (z) = exp [i(zı Qr + 220)] . (5.39) 
We therefore find that 


. D 2 
jim Mele) = fim [lUo] = zz . (5.40) 
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In the limit V — ov, any two states |z) and |z } with z Æ z’ become orthogonal. 
It is easy to generalize the above calculation to show that, in fact, any vector in the 
Fock space built above |z) must be orthogonal to |z } in this limit.! In other words, 
the ket-vector |z } does not lie in the Fock space built above |z) at all. Moreover, the 
matrix element of any observable constructed using a finite number of annihilation 
and creation operators between the |z) and |z } states vanishes. Altogether, the states 
|z) with z € C therefore give rise to a completely disconnected Fock space each. 
Transitions between Fock spaces with different vacuum states are forbidden. This is 
certainly not what we expected! 

This singular behavior of the V — oo limit is easy to understand. Just remember 
that |z) is an eigenstate of ag with the eigenvalue /V z. Thus, if we insist on keeping 
z fixed, then scaling V towards infinity makes the eigenvalue diverge as well. This 
is ultimately the reason why |z } does not lie in the Fock space built above |z). 
We might, on the other hand, try to keep the eigenvalue of ag fixed, which would 
require scaling z as 1/./V as the volume grows. Roughly speaking, this means that 
in a large but finite volume V, transitions between Fock spaces with different z are 
allowed, but only within a range |z =z | <S1/VV. 

There is another way of looking at this, emphasizing the properties of the 
operator U (z) defined by (5.39). Since the states |0} and |’ } do not lie in the 
same Fock space, the operator U (z) becomes ill-defined in the limit V —> oo. 
Spontaneously broken symmetry may not be realized by unitary operators on 
the Hilbert space of the physical system! That however does not mean that in 
presence of SSB, it makes no sense to speak of symmetry transformations that are 
spontaneously broken. The transformation of the field operator by U (z), 


UWD YUR) = W(x) +z, (5.41) 


is perfectly well-defined even in the limit V —> oo. The same applies to any 
local operator built out of a finite number of y and yÏ. In each of the Fock 
spaces built above any |z), we thus have a well-defined notion of broken symmetry 
transformations in terms of averages of observables. We just have to be careful not 
to think of broken symmetry as an operator connecting different states in the same 
space. 


The distinction between the realizations of broken symmetries by operators 
on states and by transformations of observables is not a mere mathematical 
curiosity one can safely ignore. It has observable consequences. One of 
the earliest applications of group theory to physics was to classify energy 


(continued) 


' Recall that the basis of the Fock space consists of states obtained by acting with a finite number 
of creation operators on the vacuum. 
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levels of a quantum system in terms of representations of its symmetry 
group. In quantum mechanics, the so-called Wigner theorem guarantees that 
any symmetry is represented by a unitary or antiunitary operator on the 
Hilbert space (see Chap. 2 of [13]). Energy levels are therefore classified by 
irreducible representations of the symmetry group G. In quantum field theory, 
on the other hand, only the unbroken subgroup H is as a rule realized by 
unitary operators on the Hilbert space. Thus, energy levels form irreducible 
multiplets of H. SSB implies degeneracy of the ground state, accompanied 
by a reduced degeneracy of the excitation spectrum. 

For example, the spectrum of hadrons is organized into (approximately 
degenerate) multiplets of the isospin group SU(2). We know from Exam- 
ple 5.5 that this is the unbroken subgroup of QCD with two light quark 
flavors. There is an even higher degeneracy, corresponding to multiplets of 
SU(3). This is however less accurate due to the large splitting between the 
masses of the up, down and strange quarks. There are no multiplets of the 
SU(np)L x SU(nf)r symmetry of QCD. Such multiplets would be easily 
noticeable. For instance, the pseudoscalar mesons such as pions or kaons 
would have to appear in the spectrum together with particles of opposite 
parity. The lack of such particles is a directly observable manifestation 
of the nonexistence of unitary operators representing spontaneously broken 
symmetry. 


5.3.3 Uniqueness of the Finite-Volume Ground State 


The above excursion into the subtleties of SSB was purposefully limited to a single, 
exactly solvable example. The conclusion that in infinite volume, spontaneously 
broken symmetry is not realized by unitary operators on the Hilbert space is, 
however, general [14]. Likewise, it is generally true that different symmetry- 
breaking vacuum states belong to disconnected Hilbert spaces. 

One property of the free Schrödinger theory (5.29) that is not general is the 
existence of exactly degenerate ground states already in a finite volume. This is 
typical for systems where the order parameter is constructed using an operator that 
commutes with the Hamiltonian. In our case, this operator can be taken as the spatial 
average of the Schrédinger field, 


v= > fa w(x) = —a . (5.42) 


Another system of this type is the ferromagnet, where the order parameter is sup- 
plied by a selected component of total spin. Historically, the very special properties 
of such systems led to a controversy (involving some famous individuals [15]) as to 
whether they should count as SSB at all or not. 
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An alternative is that in any finite volume, the system possesses a unique ground 
state, which is then necessarily G-invariant. Above this ground state, there is a tower 
of closely spaced energy levels, which all become exactly degenerate in the limit 
V — oo. A notable, much studied example of such a system are antiferromagnets. 
The realization of SSB is then quite nontrivial. Although a detailed discussion would 
be beyond the scope of this book, let me make at least a few remarks. The reader is 
encouraged to consult Sect. 19.1 of [16] or Sect. 2 of [1] for further information. 

If one tries to take the naive limit V — oo, the unique, finite-volume G-invariant 
ground state turns into a likewise G-invariant state in infinite volume. The latter is, 
however, unstable under infinitesimally small external perturbations. The instability 
has its precursor already in a finite volume. Namely, the magnitude of a perturbation 
needed to drive the system away from the G-invariant vacuum scales as the inverse 
of the system’s volume. Such a perturbation may be supplied in particular by the 
measurement of any symmetry-breaking observable. This makes the system relax 
into one of the pure symmetry-breaking states constructed in Sect. 5.2.2. 

Another way to understand the distinction between the various would-be ground 
states is through correlations of spatially separated observables. For any two local 
observables A(x) and B(x), one may construct the spatial correlation function 
(A(x) B(y)); the angular brackets indicate expectation value in a chosen ground 
state. The so-called cluster decomposition principle dictates that 


lim [(A(@®)B(y)) — (Aœ) (B(y))] =0. (5.43) 


|x—y|> 00 


It encodes the fundamental requirement that the outcomes of distant measurements 
should not be correlated with each other. The naive G-invariant ground state 
obtained by taking the V — oo limit violates this principle, and thus displays long- 
range entanglement. The entanglement is destroyed by any measurement of a local 
symmetry-breaking observable, which brings the system to a stable, symmetry- 
breaking ground state satisfying (5.43). 

To illustrate some of the above general observations, I will conclude the 
discussion of SSB by working out one more simple toy model. We return to the 
free Schrédinger theory, but twist the boundary condition to 


w(x +e, L) =e wx). (5.44) 


Here e, is a unit vector along the r-th Cartesian coordinate axis, and 0, € 
(—z, +71] is an arbitrarily chosen phase. I assume that at least one of 6, is nonzero, 
otherwise (5.44) would be just the periodic boundary condition we used previously. 

The quantization of the theory proceeds as before, except for the allowed values 
of momenta, where (5.30) is replaced with 


7 6, 2an, 
L L 


Pr 7 n EZ. (5.45) 
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Fig. 5.3 Schematic 
visualization of the spectrum 
of the free Schrédinger theory P 1 x n-=(1,0,0,...) 
with the twisted boundary ° : 
condition (5.44), for small and permutations 
(positive) 6,. Each of the 
indicated groups of energy 
levels consists of states with 
multiple quanta of E 1 x n-=(-1,0,0,...) 
momentum 0/L; the number Š 
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and permutations 


of such quanta is denoted as n=2 

n. The lowest-lying multiplet n=1 

includes no other particles. n=0 

The higher multiplets include 

one or more particles where 

some of the n, in (5.45) are 

nonzero. For simplicity, all 

the 6, are assumed to be 

equal; otherwise the spectrum 

would feature additional : 

structure e only n,=(0,0,0,...) 
n=2 ——~ 
n=1 
n=0 


The Hamiltonian (5.32) now has a unique ground state, namely the Fock vacuum 
|0). The lowest-lying excitation of this ground state corresponds to n, = 0 for all 
r, that is, p = 0/L. This is a one-particle state |0/L) = aye |0) with excitation 
energy 


02 


——. 5.46 
2m L2 ( ) 


Foj_ = 
As long as all the 6, are sufficiently small, the low end of the excitation spectrum 
will consist entirely of states with n € N quanta of momentum 0/L and excitation 
energy nEg/z. Only above these will one find states for which some of the n, are 
nonzero. See Fig. 5.3 for a visualization of the spectrum of energy levels. 
How is the Fock vacuum |0) affected by perturbations? Since the p = 0 state 
is excluded by the boundary condition, we can now add a perturbation linear in the 
field without running into an instability. Let us therefore add to (5.29) the term 


Liy, xx) = S yaan 4 Vl werent (5.47) 


where € is a small constant parameter with the dimension of energy. The expo- 
nential, coordinate-dependent factors are added in order that % itself satisfies 
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a periodic boundary condition. The corresponding correction to the Hamilto- 
nian (5.32) is 


H: = —(€*aoj1 + eaj) , (5.48) 


This perturbation will change the energy eigenstates within the multiplets shown 
in Fig. 5.3. It will however not lead to any mixing between states from different 
multiplets, that is states with different n,. Let us focus on the lowest-lying multiplet, 
which is where the ground state resides. Here the Hamiltonian is projected down to 


Ho/L = E9144 /1,40/L — (e*agjL + €44 1) 
= EgjL[ (a$; — 22)(a0/1. — Ze) — lzel? ] , (5.49) 


where ze = €/Eg/z,. There is still a unique ground state, but it is now the coherent 
state |z-). The perturbation has lowered the energy compared to the Fock vacuum 
|0) to —Egyz |zel? = — lel? /Eoyz. 

Let us see what we have found. The original vacuum |0) is nondegenerate. Its 
separation from the nearest excited states is however only of the order of Eo; ~ 
1/L?. In the limit of infinite volume, all the different multiplets of states shown 
in Fig.5.3 become degenerate. Accordingly, the Fock vacuum becomes unstable 
under small perturbations. For small fixed 6,, one can always find e that satisfies the 
bounds 


K —. (5.50) 


The lower bound makes the change in energy caused by the perturbation much larger 
than the level spacing within a given multiplet. Thus, even a small perturbation can 
drive the ground state from |0} to a coherent state |ze} with large ze. The upper bound 
in (5.50) ensures that the shift of energy eigenvalues induced by the perturbation is 
much smaller than the separation of the different multiplets in the spectrum. 


The free Schrödinger theory with the twisted boundary condition (5.44) serves 
as a good example of the instability of the finite-volume ground state with 
respect to small perturbations. Likewise, it provides a decent illustration 
of the structure of the spectrum of systems with SSB in a finite volume. 
One important feature that it does not illustrate is the G-invariance of the 
finite-volume ground state. This is because the symmetries (5.35) of the free 
Schrödinger theory are broken by the boundary condition (5.44) for any 
nonzero 6,. In fact, insofar as the expectation value (y(x)) is well-defined, 
shift symmetries of the type (5.35) must always be spontaneously broken. 
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In Chaps. 2 and 3, we saw examples of how spontaneous symmetry breaking (SSB) 
gives rise to massless particles in the spectrum. Our operational understanding of 
these Nambu-—Goldstone (NG) bosons was based on a classical (tree-level) analysis 
of certain scalar field potentials. The purpose of this chapter is to develop a 
general understanding of NG bosons without the limitations of a particular model 
or approximation. The basic intuition is built in Sect. 6.1. This explains both where 
NG bosons come from, and how their spectrum is related to the nature of broken 
symmetry. The subsequent sections put this intuition on a solid footing. In Sect. 6.2 
I prove the so-called Goldstone theorem, which asserts the very existence of a NG 
boson as a consequence of SSB. Section 6.3 then delves into the question how 
many NG bosons there are in a given system, and how their number relates to 
their dispersion relations. This generalizes the observations made in Chap. 3; see 
Sect. 3.3 for a quick summary. Altogether, this chapter completes the background 
needed in Parts III and IV, where the effective field theory (EFT) formalism for SSB 
is developed in detail. 


6.1 Intuitive Picture 


The intuitive understanding of NG bosons relies heavily on the concept of order 
parameter. Consider for simplicity a system whose ground state is spatially uniform, 
and imagine that we disturb it by a weak, local perturbation. Locally, it is 
energetically favorable for the system to remain in one of the degenerate ground 
states. The order parameter will therefore respond to the perturbation by developing 
spatial variation while remaining on the vacuum manifold everywhere. The energy 
cost of creating such a spatially varying excited state must be proportional to 
gradients of the order parameter. This is because changing the order parameter 
uniformly just amounts to a different choice of ground state. We can then imagine 
order parameter configurations that vary over progressively longer and longer length 
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scales. In the limit that the scale of spatial variation goes to infinity, the gradient of 
the order parameter vanishes, and the energy cost must go to zero. 

This is a completely general observation, valid regardless of the choice of field 
variables and dynamics behind SSB. The only assumption that is really necessary is 
that the spontaneously broken symmetry is continuous. This makes smooth spatial 
variation of the order parameter possible while keeping it on the vacuum manifold 
everywhere. We conclude that NG bosons are local, propagating fluctuations of the 
order parameter whose energy goes to zero in the infinite-wavelength limit. 

The identification of NG bosons with fluctuations of the order parameter is not 
necessarily one-to-one. Eventually, we would like to know the precise spectrum 
of NG bosons; this among others governs the low-temperature thermodynamics of 
systems with SSB. What we really need in order to establish a dictionary between 
broken symmetries and NG bosons is to answer the following questions: 


e How many different types of order parameter fluctuations (NG fields) are there 
in a given system? 

e What is the correspondence between the various order parameter fluctuations 
(NG fields) and NG modes in the spectrum? 


I will deal with these two questions in the given order in the next two subsections. 


6.1.1 Redundancy of Order Parameter Fluctuations 


I will build upon the intuitive picture of a NG boson in terms of a fluctuation of 
the order parameter that locally remains on the vacuum manifold. One can imagine 
such a fluctuation as being generated from the ground state by a broken symmetry 
transformation with a coordinate-dependent parameter. This leads immediately to 
the important observation that fluctuations induced by different broken symmetry 
generators may coincide. 


Example 6.1 


The Lagrangian of a free massless relativistic scalar field ¢ is Z[@] = (1/ 2) (Iu). 
The action of this theory is invariant under the polynomial shift transformation 
o(x) > o(x)+€1+€2,x", where €1, €2,, are constant parameters. This symmetry is 
necessarily spontaneously broken, and the order parameter can be chosen as (#(x)). 
One might expect to obtain D + 1 different fluctuations of the order parameter by 
applying a polynomial shift with coordinate-dependent parameters €) (x), €24 (x). 
However, shifting @(x) by €24 (x)x” is identical to shifting it by €,(x) if we 
set €1 (x) = €2,(x)x". The set of D + 1 broken generators associated with our 
polynomial shift therefore corresponds to a single independent fluctuation, induced 
by d(x) > (x) + €1 (x). The only independent NG field in the theory is ¢ itself. 
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Symmetries that become equivalent once their parameters are made coordinate- 
dependent are called redundant. Such a redundancy is typical for spacetime 
symmetries. As Example 6.1 shows, however, it is also possible for point symme- 
tries that do not affect spacetime coordinates. Here is a less trivial example, relevant 
for any crystalline phase of matter. 


Example 6.2 


In relativistic theories of scalar fields, the generators of spacetime rotations J” 
and translations P” are known to satisfy the relation J’? = x*P” — x” P”. A 
local rotation exp[(i/2)€2,»(x)J""] with antisymmetric matrix parameter €2,,) is 
thus equivalent to a local translation exp[ie1,(x) P”] with eip (x) = —€2y (x)x”. 
Imagine now a system where spacetime rotations and translations (or a subset 
thereof) are both spontaneously broken. For example, in crystalline solids, all the 
continuous spatial rotations and translations are spontaneously broken. The rotations 
are clearly redundant; the only independent NG fields are those associated with 
broken translations. These parameterize the vibrations of the crystal lattice. 


The basic moral to remember is that the number of independent order parameter 
fluctuations may be lower than the number of broken symmetry generators. I will 
further refine this observation in Sect. 6.3.1. The discussion of redundancy of local 
symmetry transformations will be of great importance for the development of EFT 
for broken spacetime symmetries in Part IV. 


6.1.2 Canonical Conjugation of Nambu-Goldstone Fields 


The number of NG modes in the spectrum may be lower than the number of 
independent NG fields in case some of the latter are canonically conjugated. We 
saw this in Sect. 3.2. To make statements independent of a specific model, we need 
to relate the possibility of canonical conjugation of NG fields directly to the broken 
symmetry. 

Suppose we have two NG fields, m! and 2, associated with two broken 
symmetry generators Q; 7. We can assume without loss of generality that under 
the respective transformations generated by Q4 2, these fields transform as 


1 


nt > n" + e[t + O(z)], a=1,2. (6.1) 


Indeed, one can always choose the fields so that the ground state corresponds 
to x“ = 0. That the leading, constant piece of the Taylor expansion of the 
transformation rule for x° around this ground state is nonzero, follows from the 
assumption of broken symmetry. The bases of NG fields and broken generators can 
then be aligned so that this constant piece is simply 7“ —> m° + €*. 
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Suppose in addition that we can construct a low-energy EFT for the NG fields 
whose Lagrangian contains a term with a single time derivative, 


Loe = onton? +--+. (6.2) 


The ellipsis stands for terms with more than one derivative or more than two NG 
fields. Recall from Sect. 4.2.1 that the Noether currents corresponding to (6.1) can 
be identified by making the parameters €“ coordinate-dependent. From (6.2) alone, 
one can then extract the leading contributions to the Noether charge densities, 


Fix] = on? +- , Join] =+onn'+---. (6.3) 


The ellipses now represent terms with derivatives or more than one NG field. 

Here comes the key step: I will evaluate the transformation of JR [7x] under the 
symmetry generated by Q; in two different ways. On the one hand, it is obvious 
that under (6.1), JRI] > JS tx] + 012€! +--+. On the other hand, one may think 
of the currents as quantum operators and represent the same transformation as 


exp(ie! Q1) J? exp(—ie'Q1) = JP + ie'[Q1, JP] + OCh’) . (6.4) 


The NG fields vanish in the ground state. Hence, upon taking the vacuum expecta- 
tion value (VEV), a comparison of our two little calculations leads to 


012 = i([Q1, JP1) = —i(lQ2, JP) , (6.5) 


where the second equality follows by running the same argument on J H [7x] and Q2. 

This is a remarkable result that is as model-independent as it gets. In any theory 
that realizes the same symmetry-breaking pattern, one can construct the Noether 
currents via Noether’s theorem. Evaluating the VEV of the commutator (6.5) then 
gives us a simple criterion for when to expect two NG fields to be canonically 
conjugated: whenever (6.5) is nonzero. 


Example 6.3 


We already saw the relation between canonical conjugation and the charge com- 
mutator (6.5) at work in Sect. 5.3.1. Indeed, one can think of the free Schrödinger 
field y therein as corresponding to two real NG fields. These are in turn associated 
with the invariance of the free Schrödinger theory under independent shifts of the 
real and imaginary parts of y. The commutator of the generators of these shifts, 
QR and Qj, turns out to be nonzero upon quantization. It is easy to check that the 
normalization of the commutator as shown in (5.36) agrees with (6.5). The factor 
of spatial volume V in (5.36) arises from the latter displaying a commutator of two 
charges rather than a charge and a charge density. 
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Although noninteracting, the free Schrédinger theory is nontrivial in that it makes 
the VEV (6.5) arise from a central charge in the symmetry algebra. Let us have a 
look at one more example, which is mathematically simpler and physically familiar. 


Example 6.4 


In the ideal (that is spatially uniform and isotropic) approximation, both ferro- 
magnets and antiferromagnets possess a symmetry G ~ SU(2), generated by the 
operator S of total spin. In the ground state of both, this is spontaneously broken 
down to H œ U(1), consisting of spin rotations around the axis along which the 
spins are aligned. Suppose that one chooses the ground state oriented along the third 
direction in the spin space. Then H is generated by $3, whereas the two independent 
broken generators can be chosen as $1,2. 

The difference between ferro- and antiferromagnets is that the former feature a 
nonzero net magnetization. This amounts to a nonzero VEV ($3) = —i([S1, S2]). 
As a consequence, the two NG fields in a ferromagnet are canonically conjugated. 
The spectrum only contains a single NG mode: the ferromagnetic magnon. Its 
dispersion relation at long wavelengths is known to be quadratic in momentum. 
In antiferromagnets, on the other hand, (S) = 0. As a consequence, the spectrum of 
antiferromagnets features two different magnon branches. Their dispersion relation 
is known to be linear in momentum in the long-wavelength limit. 


6.1.3 The Big Picture 


Let me summarize what we have learned so far. Spontaneous breaking of a 
continuous symmetry implies the existence of a gapless mode in the spectrum: the 
NG boson. This can be viewed as a propagating fluctuation of the order parameter. 
The number of such NG modes may be lower than the number of broken symmetry 
generators. There are two basic mechanisms how such a reduction may occur. 
First, different broken symmetry generators may induce fluctuations of the order 
parameter that are indistinguishable from each other. I will refer to this mechanism 
as “geometric” to keep in mind that it represents a restriction on the configuration 
space of off-shell NG fields. Second, there may not be a one-to-one correspondence 
between the independent NG fields and the independent NG modes in the spectrum. 
This happens whenever a pair of NG fields is canonically conjugated, which in turn 
requires a nonzero VEV of the charge commutator (6.5). This mechanism I will 
refer to as “dynamical” since it restricts the space of on-shell fields, or eigenstates 
of the Hamiltonian. For the reader’s convenience, an outline of the correspondence 
between broken symmetry, NG fields and NG modes is shown graphically in 
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broken 
generators 


geometric reduction by 
redundancy of fluctuations 


fluctuations of 
order parameter 


dynamical reduction by 
canonical conjugation 


gapless modes 

in the spectrum 
Fig. 6.1 Basic mechanisms that may cause the number of independent NG modes in the spectrum 
to be lower than the number of broken symmetry generators. The geometric reduction arises when- 
ever fluctuations of the order parameter, induced by different generators, are indistinguishable from 


each other. The dynamical reduction is signaled by a nonzero VEV of the charge commutator (6.5), 
or by a term in the effective Lagrangian for NG fields with a single time derivative 


Fig. 6.1. This scheme is intuitively simple yet somewhat imprecise. I will return 
to the question of how many NG modes there are in a given system in Sect. 6.3. 


6.2 Goldstone Theorem 


The Goldstone theorem is one of the most profound exact results in quantum field 
theory that deserves a more careful justification than the intuitive but hand-waving 
argument of Sect. 6.1. In the next two subsections, I will give two different proofs, 
following in spirit the original work of Goldstone, Salam and Weinberg [1,2]. Along 
the way, I will clarify the technical assumptions on which the theorem is based. A 
reader seeking a higher level of mathematical rigor than what I can offer here is 
advised to consult the early review [3] or the book [4]. 


6.2.1 Operator Proof 


I will start with a proof using the operator formalism of quantum field theory in the 
Heisenberg picture. Before proceeding to the details of the proof, however, I need 
to spend some time listing its technical assumptions. 

First, I assume that in the limit of infinite spatial volume, the system possesses 
unbroken continuous (spacetime) translation invariance. This ensures the existence 
of a basis of the Hilbert space consisting of eigenstates of the energy-momentum 
operator P”. It is always possible, and I will implicitly do so, to define this 
operator so that the energy and momentum of the ground state are both zero. The 
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assumption of continuous spatial translation invariance is, in fact, unnecessarily 
strong. The minimum requirement is unbroken discrete translation invariance, 
which is necessary to have a well-defined notion of quasiparticles. The proof then 
proceeds along the same steps at the cost of a somewhat cluttered notation [5]. 

Second, I assume the existence of a local operator J°(x), interpreted as the 
density of a conserved charge. The charge density should not depend explicitly on 
spacetime coordinates; this is needed to ensure the translation property 


J9(x) = el * JO (Oe. (6.6) 


Now define the total charge contained in a finite spatial domain Q, 
Qot) = | d% J? (x,t). (6.7) 
Q 


The last assumption we need is the existence of a local, time-independent operator 
® such that the VEV in the chosen ground state |0), 


(0|[Qa(4), ®] |0) , (6.8) 


is nonzero and time-independent in the limit Q — oo. 

It is this last assumption that captures the essence of SSB. Think of Qg as the 
Noether charge associated with a continuous symmetry. One would be tempted to 
say that in the limit Q —> œ, Qg itself becomes time-independent. But we saw in 
Sect. 5.3 that in the infinite-volume limit, the operators representing spontaneously 
broken symmetry may be ill-defined. The limit Q — oo is only safe if performed 
on the commutator in (6.8), not on Qg itself. 


Usually, the time-independence of the integral charge is a consequence of 
a local conservation law, 3p J” = 0. That is however strictly speaking not 
necessary. Taking this additional step requires the vanishing of the surface 
integral Ne d¢x ð- J” (x,t) as Q — ov. This condition, or even its weaker 
form imposed only on the VEV (0| [0, J” (x, t), ®] |O), may be compromised 
in systems with long-range interactions [3]. Here I avoid having to deal with 
this issue simply by assuming right away the time-independence of (6.8). 


Think of (0| ®|0) as an order parameter. Then (6.8) is, up to a factor, its 
infinitesimal variation under the transformation ® — exp(ié Qo)® exp(—ie Qo). 
The assumption that (6.8) is nonzero guarantees that |0) cannot be an eigenstate of 
Qg. Hence the symmetry generated by Qo must be spontaneously broken. 

We are now ready to prove the existence of a NG mode in the system. I will 
initially assume that the system is enclosed in a finite volume V with a periodic 
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boundary condition. The Hilbert space then admits a basis of momentum eigenstates 
|n, p), where p stands for a set of discrete momenta consistent with the boundary 
condition. In a finite volume, the basis states can be normalized as (m, pln, q) = 
Ôômnôpq. The label n indicates all other quantum numbers the states may possess, 
such as relative momenta in multiparticle states, or internal degrees of freedom. 

Upon inserting the partition of unity in terms of |n, p) and using the translation 
property (6.6), the VEV (6.8) can be rewritten as 


Olla, ©] 10) =>) f d | exp(—ipn - x) (01 J’) In, p) (n, pl ® 10) 
n,p 


— exp(tipn +x) (01 ® In, p) (n, p1 ICO) 10) ] 
(6.9) 


Here p, is a shorthand notation for the energy-momentum of |n, p). Now recall that 
SSB implies the existence of degenerate ground states. Even if one picks a unique 
vacuum |0), the other, alternative ground states do not disappear. They contribute 
to (6.9) among the states |n, 0) with p = 0. Is this something to worry about? 

In the free Schrödinger theory, worked out in Sect. 5.3.1, (0| J? (0) |n, 0) scales 
as 1/4/V, cf. (5.35). This observation turns out to be general. With the periodic 
boundary condition, the integral charge Qy (t) is translationally invariant. A classic 
argument [6] then shows that the norm of Qy (t) |0) scales as V/V, 


|| Ov (t) 10) IP =f a (0| J°(x, t) Ov (£) 10) = V (0| J°(0) Qv (0) |0) . 
(6.10) 


This means, roughly, that the norm of J°(0)|0) scales as 1/./V, and so does 
therefore (0| J°(0) |n, 0). Altogether, the contribution of the alternative ground 
states to (6.9) at fixed Q and increasing V is suppressed as 1/./V. Should (6.8) 
be nonzero in the infinite-volume limit, it must be dominated by excitations with 
nonzero momentum. 

We can now switch to infinite volume. This requires changing the normalization 
of the basis states to (m, pln, q) = (271)45mn64(p — q) and replacing the sum over 
p in (6.9) with an integral. Keeping © still finite, the integration over x then amounts 
to a Fourier transform of unity, ie d¢x eiP* = (27) 454, (p). The notation underlines 
that F- (p) is a finite-volume approximation of the Dirac 6-function. We then have 


(0| [Qa(), 8110) = X` J d'p { expl-iEn (p)t154(p) (01 J°(0) In, p) (6.11) 


x (n, p| ® |0) — exp[+i En (p)t18&(—p) (0| © In, p) (n, pl J°(0) 10) } , 


where En (p) = p? is the dispersion relation of the state |n, p). 
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By our assumptions, (6.8) should be nonzero and time-independent in the limit 
Q — oo. For large Q, the function 54 (p) is smooth but sharply peaked around 
p = 0. The time-independence thus requires that only states such that 


lim En(p) =0 (6.12) 
p—0 


contribute to (6.11). The assumption that (6.8) should be nonzero guarantees 
that such states exist. These must be one-particle states, for if the one-particle 
spectrum had a gap, then so would the multiparticle one. We have thus established 
the existence of one-particle states |n, p) with the property (6.12), for which 
both (0| J°(0) |n, p) and (0| ® |n, p) are nonzero. This concludes the proof of 
Goldstone’s theorem. 

There is a simple, intuitive way to understand why NG modes must 
satisfy (6.12). Suppose we know beforehand that there is a one-particle state 
|n, p) created by J? and that the latter descends from a local conservation 
law, ðJ” = 0. Extending the translation property (6.6) to J”, we get at 
once (O| J” (x)|n, p) = exp(—ipn - x) (O| J” (0) |n, p). Upon using current 
conservation and the fact that (0| J°(0) |n, p) is nonzero, it then follows that 
E,(p) = p - (0| J(0) |n, p)/(0| J? (0) |n, p). This shows that the vanishing of the 
energy of NG modes in the long-wavelength limit is an immediate consequence of 
symmetry. The nontrivial part of Goldstone’s theorem is that such NG states exist 
at all. 


6.2.2 Effective Action Proof 


I will now present an entirely different proof of Goldstone’s theorem which will 
prepare the ground for the discussion of redundancies among NG fields in the next 
section. In spirit, this proof is not based on current conservation but rather directly 
on the symmetry of the action. I will follow closely the appendix of [7]. 

The starting assumption is the existence of an action I’ as a functional of a set 
of fields y'. The notation indicates that I is the quantum effective action, since 
we want to make exact statements about a quantum field theory. The argument 
below applies in principle equally well to the classical action S. However, the fields 
yi need not be the “elementary” fields that appear in S. They may be composite 
operators, and are chosen so that the VEV (y‘) is a suitable order parameter for 
SSB. The action is assumed to be invariant under the infinitesimal transformation 


by! (x) = eF'[y, x](a) . (6.13) 
Unlike in the previous, operator proof of Goldstone’s theorem, the symmetry 


transformation, and thus the Noether current, is allowed to depend explicitly on 
coordinates. 
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It is not guaranteed a priori that the quantum effective action I" inherits 
verbatim all the symmetries of the classical action S. It turns out that the 
symmetry transformations on the classical and quantum actions are the same 
at least if F'[y, x] is linear in the fields; see Sect. 16.4 of [8]. This is 
something I assumed already back in Sect.5.2 when I introduced the very 
concept of order parameter. The only restriction involved in the transition to 
the quantum effective action therefore is that the symmetry should preserve 
the functional integral measure. This is needed to exclude quantum anomalies, 
whose presence would invalidate Goldstone’s theorem. 


The invariance of the action is encoded in the condition 


fe oP Fily, yl) =0. (6.14) 
~ byl (y) oe 


Taking one more variational derivative and setting the fields equal to their VEVs, 
(Yi (x)) = g' (x), one gets 


Filg, yl(y) =0. (6.15) 


[> sr 
P OSIO) yay 


Now we need the assumption of continuous translation invariance of the action and 
of the ground state. The second variational derivative of can thus be traded for 
its Fourier transform, which is the inverse propagator of the theory, G ly ](p). 
Multiplying (6.15) with e!?** and integrating over x then yields the sendiion 


stoic) f ay eP? F/T, yl(y) =0. (6.16) 


Hence, the propagator G;;[y](p) has a singularity at energy-momentum p“ 
whenever the integral in (6.16) is nonzero. The spectral representation ensures that 
such singularities arise solely from states in the spectrum of the Hamiltonian. 

By the assumption of broken symmetry, F/[y, y], which represents the sym- 
metry transformation of the order parameter, is nonzero. This alone implies that 
the integral in (6.16) must be nonzero for some p”. For coordinate-independent 
symmetries, the integral is proportional to ô? (p). Moreover, for known examples 
of coordinate-dependent symmetries, F/[g, y] is polynomial in coordinates. In such 
cases, integration over y leads to a linear combination of derivatives of 5? (p). One 
way or another, we conclude that SSB implies the existence of a mode whose energy 
vanishes in the limit of zero momentum, as expected. 
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6.3 Classification and Counting 


None of the above two proofs of the Goldstone theorem addresses the question how 
many NG bosons there are. To that end, we have to put in some extra effort. I will 
follow closely the scheme outlined in Sect. 6.1.3. This means that as the first step, 
we need to know, given the symmetry-breaking pattern, how many independent NG 
fields, or order parameter fluctuations, there are. 


6.3.1 Independent Order Parameter Fluctuations 


We add to (6.13) an index A, distinguishing the action of different symmetry 
generators Q4, òy’ (x) = e^ Fi [y, x](x). Recall the view of NG fields as local 
fluctuations of the order parameter. The easiest (albeit not the only; cf. Sect. 4.2.1) 
way to generate such fluctuations is by replacing «4 —> «^(x). Redundancies 
among order parameter fluctuations are then detected by the existence of nonzero 
functions €^ (x) such that 


ef (x) Filg, x](x) =0. (6.17) 


This relation between symmetry transformations implies an analogous dependence 
between the zero modes of the inverse propagator of the quantum theory. Namely, 
trading the coordinate inside €^ (x) for a derivative with respect to the momentum 
variable p”, denoted as V,,, a combination of (6.16) and (6.17) gives 


eA(- iv) f ay el?’ Filg, y\(y) =0. (6.18) 


The conclusion is that the number of independent NG fields equals the dimension of 
the symmetry group G minus the number of linearly independent solutions of (6.17). 
Note that the linear independence is meant in the functional sense: multiplying all 
the e4 (x) by the same function f(x) does not count as a new solution. 


The rule for counting the number of independent NG fields does not have 
an established form in the literature. My formulation is close in spirit 
to [9] where, however, the functions €^ (x) were only allowed to depend on 
coordinates corresponding to unbroken continuous translations. That seems 
too restrictive for instance in case of crystalline solids where only a discrete 
translation invariance survives in the ground state. 

Another condition for redundancy similar to (6.17) was put forward in [10]; 
see also the review [11]. Their criterion is however phrased in terms of the 
vacuum ket-vector and charge density operators. As such, it also includes 


(continued) 
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the redundancy of NG states under canonical conjugation, which I treat 
separately in Sect. 6.3.2. Finally, in [12] a rule counting independent NG fields 
is formulated directly in terms of the zero modes of the inverse propagator. 

In case of broken spacetime symmetries, the number of independent 
would-be NG fields identified with the help of (6.17) turns out to depend 
on the precise choice of order parameter(s). Remarkably, some of these fields 
may couple to gapped states in the spectrum. This subtlety is missed by the 
classical counting rule for NG fields based on (6.17). It is best addressed 
having the full EFT machinery at hand. I will do so in Chap. 13. 


As a basic sanity check, consider spontaneously broken internal symmetries 
in systems with unbroken translation invariance. In that case, F [p, x] does not 
depend on coordinates, either explicitly or implicitly through the order parameter. 
Also, it is an ordinary local function of g', hence the condition (6.17) reduces to 
EA(x)F _ (gy) = 0. The only nontrivial solutions for €^ (x) are then those corre- 
sponding to generators of the unbroken subgroup H. The number of independent 
NG fields therefore equals dim G — dim H. For spontaneously broken internal 
symmetries, there is a one-to-one correspondence between NG fields and broken 
symmetry generators. This is one of the moral lessons we reached in Chap. 3 by 
analyzing a set of specific models. 

Oftentimes, the solutions to (6.17) descend from redundancy among local 
symmetry transformations acting on an arbitrary field configuration. This was 
certainly the case for Example 6.1 and Example 6.2. The condition (6.17) is however 
much weaker, as it only demands redundancy of local symmetry transformations of 
the ground state. Here is a nontrivial example of this latter type. 


Example 6.5 


Helimagnets are magnetic materials in which the alignment axis of spins gradually 
changes with position; see [13] for basic phenomenology. The simplest type of 
helimagnetic state can be modeled by the unit-vector order parameter 


(n(x, t)) = (cos kz, sin kz, 0) , (6.19) 


where k is a positive constant. The local axis of alignment lies in the xy plane but 
rotates around the z axis, forming a helix with the pitch 277/k. 

Suppose for simplicity that the material in which the helical order develops has 
a cubic crystal lattice. This is the case for instance for the inorganic compounds 
MnSi and FeGe; see Sect. 2 of [14] for a more comprehensive list of known 
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helimagnetic materials. Then the low-energy physics of the material exhibits the 
following approximate continuous symmetries (in addition to time translations): 


e Spatial translations (T): n’(x’, t) = n(x, t) with x’ =x +€, € € RÌ. 
e Space-spin rotations (R): n’(x’, t) = Rn(x, t) with x’ = Rx, R € SO(3). 


Unlike in ordinary (anti)ferromagnets, spin rotations cannot be considered 
separately from spatial rotations as a consequence of the spin-orbit coupling. 
This induces the so-called Dzyaloshinskii—Moriya interaction, which is eventually 
responsible for the development of the helical order (6.19). 

In their evolutionary form, the above transformations correspond respectively 
to the functions TF! = —d,n' and RF! = —e,."x°d,n' + etni. Denoting the 
corresponding infinitesimal parameters as €” (x, t) (translation in the r-th direction) 
and œ” (x, t) (rotation around the r-th axis), (6.17) turns into 


E(x, NI n (x, t)) +o" (x, Dfe x 8, (n' (x, t))—e', (ni (x, 1))] =0. (6.20) 
This boils down to the following constraints on the parameter functions, 


ol (x, t) sin kz = w(x, t)coskz , 
(6.21) 
a (x,t) = kle (x,t) + yo! (x, t) — xa (x, t)] . 


Some of the solutions of (6.21) are trivial. First of all, €e! and €? can be chosen 
arbitrarily since they do not appear in (6.21) at all. These represent unbroken 
translations in the x, y directions. Furthermore, there is another unbroken symmetry, 
corresponding to simultaneous translation in the z-direction and rotation around 
the z-axis. This extends to the solution of (6.21) with œ? (x,t) = ke?(x,f) and 
wt? (x,t) = 0. 

In addition to the three trivial solutions that identify unbroken symmetries, 
there is one solution of (6.21) that represents a genuine redundancy of order 
parameter fluctuations. This can be thought of for instance as picking w! arbitrarily 
and adjusting w? and œ? so that (6.21) is satisfied. Altogether, this makes four 
independent solutions of (6.17) in case of the order parameter (6.19). With six 
different symmetry generators, three translations and three rotations, this leaves us 
with mere two independent NG fields. That is a result we could have guessed: the 
unit vector field n(x, t) contains exactly two degrees of freedom. 


This example demonstrates that while possibly tedious, finding the solutions 
to (6.17) as a set of linear equations for €4 (x) is completely straightforward. Even 
though (6.17) does not give an explicit expression for the number of independent 
NG fields, it therefore offers a streamlined algorithmic procedure how to find them. 
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6.3.2 Type-A and Type-B Nambu-Goldstone Bosons 


Now we finally get to the question how many NG bosons there are in the spectrum. 
I will start with the simpler case of spontaneously broken internal symmetry in 
systems with unbroken translation invariance. In this case, there are guaranteed to 
be dim G — dim H NG fields, in a one-to-one correspondence to broken symmetry 
generators. Finding the number of NG modes requires just a mild generalization of 
the argument in Sect. 6.1.2. 

As the first step, one introduces the commutator matrix 


oas =il[Qa, Jpl) (6.22) 


Translation invariance allows to rewrite this as oag = ilimy_-+o0([Qa, Qzp])/V. 
This makes it clear that 94g = 0 whenever Q4 or Qp is unbroken. The nontrivial 
part of the commutator matrix therefore resides in its restriction to broken generators 
Qa.p,..., that is Qap. Moreover, oap is real antisymmetric, and as such can be block- 
diagonalized by an orthogonal change of basis of symmetry generators. With the 
shorthand notation rg = rank ọ, this implies that the part of the effective Lagrangian 
for NG fields zr“, linear in time derivatives, takes the form 


Leip = Q12 OM? +++ + Ory Ary HO | Ag"? +++. (6.23) 


The ellipsis indicates operators with more than one derivative or two NG fields. 

This allows a unique categorization of NG fields. The rg NG fields 7“ with a = 
1,..., Fo are canonically conjugated into rg/2 pairs. Each such pair corresponds to 
one type-B NG boson. Barring accidental cancellations in the spatial gradient part 
of the effective Lagrangian, the dispersion relation of type-B NG bosons is typically 
quadratic in momentum. The remaining NG fields x“ witha = rg+1,..., dim G — 
dim H do not appear in any bilinear operator with a single time derivative. They 
correspond to one type-A NG boson each. As a rule of thumb, free type-A bosons 
are described by bilinear operators in the Lagrangian with two temporal or two 
spatial derivatives. Hence, their dispersion relation is linear in momentum. 

We have arrived at a simple counting rule for the respective numbers of type-A 
and type-B NG bosons [15, 16], 


1 
na = dim G — dim H — ranko , ng = z ranko . (6.24) 


This characterizes the spectrum of NG bosons based solely on the pattern of 
symmetry breaking and a bit of additional information about the ground state, 
encoded in the matrix 94g. The possible values of ọ4pg are strongly restricted by 
the unbroken symmetry. To start with, in systems with unbroken Lorentz invariance, 
terms in the Lagrangian with a single time derivative are forbidden, hence ọ4g = 0. 
This means that na = dim G — dim H and ng = O. All relativistic NG bosons are 
massless particles with a strictly linear dispersion relation. 
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Let us now relax the requirement of Lorentz invariance, but keep the assumption 
that only internal symmetry is spontaneously broken and that the system possesses 
continuous translation invariance. If in addition the internal symmetry group G is 
compact and semisimple, we can write 


(Qc) 


(LQa, Qa) l (6.25) 
V V 


Qas =i lim =- ffp lim 

V->co V=>œ 
where Tia are the structure constants of G. This shows that the VEV of the 
conserved charges themselves can serve as an order parameter for SSB. The set 
of VEVs (Qc) furnishes a vector in the adjoint representation of G. At the same 
time, it must remain unchanged under the action of H. As a consequence, the vector 
(Qc) must correspond to a trivial one-dimensional representation (singlet) of H. 
Finding such singlets in the decomposition of the Lie algebra g of G into irreducible 
multiplets of H is a quick way to isolate candidate order parameters. 

Focusing on singlets of H in the decomposition of the adjoint representation of G 
may not be restrictive enough in case H is small or even trivial. We can however do 
much better than that. Namely, it is always possible to choose the ground state and 
basis of generators Q 4 such that all the charges that have a nonzero VEV mutually 
commute. A detailed proof of this statement is given in the appendix of [7]. Thus, 
to identify conserved charge order parameters, it is sufficient to restrict to a Cartan 
subalgebra of g and find singlets of H therein. By the usual root decomposition of 
Lie algebras, all the generators of G are organized into pairs whose commutator lies 
in the Cartan subalgebra of g. This maps type-B NG bosons uniquely to pairs of 
mutually conjugate roots of g. 


Example 6.6 


Consider the symmetry-breaking pattern SU(n) —> U(n — 1) with integer n > 2. 
The unbroken subgroup H ~ U(n — 1) can be realized explicitly for instance as the 
set of unitary block-diagonal matrices with blocks of sizes 1 and n — 1, respectively. 
The Lie algebra g ~ su(7) contains a unique singlet of H, namely the generator of 
the U(1) factor of H, Q = diag(—n + 1, 1,..., 1). 

The 2n — 2 broken generators can be chosen as matrices with nonzero elements 
located in the first row and column. These span a complex (n — 1)-plet of 
SU(n — 1) C H. Since all the broken generators reside in a single multiplet of H, 
we expect all the corresponding NG bosons to be either type-A or type-B. Which 
of these two options is realized depends on the VEV (Q). If (Q) = 0, we find that 
na = 2n — 2 and ng = Q. This happens in Lorentz-invariant theories, but also for 
instance in antiferromagnets where n = 2. If, on the other hand, (Q) ~ 0 then 
the broken generators contribute pairwise to 94g. As a result, ranko = 2n — 2. 
According to (6.24), we then find na = 0 and ng = n — 1. The special case of 
n = 2 corresponds to ferromagnets. 


TALAIA ANANN 
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ferromagnet antiferromagnet 


Fig. 6.2 Schematic illustration of spin order in canted (anti)ferromagnets. In ferromagnets, the 
individual spins are slightly tilted away from perfect alignment. Antiferromagnets, on the other 
hand, exhibit overall antialignment of spins, tilted so that a nonzero net magnetization arises. In 
both cases, the whole SU(2) group of spin rotations is spontaneously broken 


Example 6.7 


For an example of a system where a type-B NG boson arises from the VEV of a 
broken symmetry generator, consider the symmetry-breaking pattern SU(2) — {e}. 
Here the unbroken subgroup is trivial and any generator may develop a nonzero 
VEV. However, the rank of antisymmetric matrices is always even. The only 
possibilities are therefore ranko = 0 and rankg = 2. In the latter case, we find 
na = npg = l. This is realized in so-called canted (anti)ferromagnets, schematically 
shown in Fig. 6.2. 


So far I have assumed that the internal symmetry group G is compact and 
semisimple so that (6.25) holds. When this assumption is not satisfied, the commuta- 
tor matrix (6.22) may receive contributions from central charges of the Lie algebra 
g. This is the case for instance for the free Schrödinger field. Without going into 
detail, let me just mention that such central extensions are classified by the so-called 
second Lie algebra cohomology of g. See Chap. 6 of [17] for further information. 

How does the counting of NG bosons change for symmetries that are not 
internal, such as spacetime symmetries? In this case, there are no established 
general results in the literature. However, the simplicity of the argument given 
in Sect. 6.1.2 suggests that the reduction of independent NG modes based on the 
charge commutator (6.22) is robust. Suppose that we have already identified a set of 
independent NG fields 7“. Let us further assume that we can map these to a set of 
broken generators Q, that act on the fields by a shift such as (6.1). We can restrict 
the definition of the commutator matrix to the generators Qa. The VEV in (6.22) 
may now in principle be coordinate-dependent, and it may no longer be appropriate 
to replace J$ with the spatial average of Qg. However, the rest of the argument 
will still go through. We will then get the same identification of type-B NG bosons 
with pairs of broken generators whose commutator has a nonzero VEV. Nontrivial 
examples of systems where the counting rule (6.24) for type-B NG bosons was still 
found to be valid include the centrally extended algebra of spatial translations in an 


6.4 Nambu-Goldstone-Like Modes 101 


external magnetic field [15], and central extensions of commutators of spatial and 
internal symmetries in presence of topological defects such as a domain wall [18] 
or a line defect [19]. 


6.4 Nambu-Goldstone-Like Modes 


The defining property of a NG boson is its relation to symmetry and its sponta- 
neous breaking. In the operator proof of Goldstone’s theorem (Sect. 6.2.1), this 
manifests through the coupling of the NG state to the broken symmetry current, 
(0| J°(0) |n, p). In the effective action proof (Sect. 6.2.2), the NG boson appears 
through a flat direction of the effective potential or action. Many physical systems 
possess excitations with the above properties that however lack the main attribute 
of a NG boson, that is being exactly gapless. Due to their similarity to genuine NG 
modes, I will call such excitations NG-like bosons. 

The existence of SSB-related modes that are not exactly gapless is of course 
only possible if some of the assumptions of Goldstone’s theorem are violated. The 
usual suspect is the symmetry. There are very few exact symmetries in nature; such 
usually only exist in our models, valid to certain, possibly high, accuracy. In this 
last section of the chapter, I will give a brief survey of different mechanisms how a 
NG-like boson may arise from a symmetry that is only approximate. 


6.4.1 Pseudo-Nambu-Goldstone Bosons 


Consider a theory whose Lagrangian can be split into a part invariant under a group 
G, and a small perturbation invariant only under some proper subgroup of G, 


L = Lin + €Lpert . (6.26) 


This is a setup that I used previously to isolate a unique ground state. The difference 
is that now I want to keep the parameter € small but nonzero. This is called explicit 
symmetry breaking. 

Suppose that in the limit € — 0, the symmetry under G is broken spontaneously. 
It is sensible to expect that by restoring small but nonzero e, the spectrum will evolve 
adiabatically. The NG mode predicted by Goldstone’s theorem will then survive, but 
its mass (gap) will no longer be exactly zero. Such “approximate” NG bosons are 
usually called pseudo-NG bosons. 

Let us try to make an educated guess at how the gap of a pseudo-NG boson 
depends parametrically on e. Imagine that we have already derived a low-energy 
EFT for the (pseudo-)NG bosons in our system. The contributions of the perturba- 
tion Lert to the effective Lagrangian can be organized by powers of e. The leading 
contribution to the bilinear part of the effective Lagrangian typically comes from a 
nonderivative operator linear in €. I will justify this claim more carefully in Chap. 8, 
where I actually construct the effective Lagrangian. 
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The rest of the argument depends on the type of the NG boson in the limit of 
vanishing perturbation. This type is still defined by the presence or absence of a 
term with a single time derivative, containing the given NG field. For type-A NG 
bosons, the bilinear Lagrangian contains two time derivatives which translate to 
squared energy in the Fourier space. For type-B NG bosons, on the other hand, a 
term with a single time derivative is present and dominates in the low-energy limit. 
This leads to the following parametric dependence of energy at zero momentum on 
E, 


lim E(p) x Je (“type-A” pseudo-NG boson) , 
p> 


(6.27) 
lim E(p)xe (“type-B” pseudo-NG boson) . 
p> 


I have just used quotation marks for a reason. There is no established, 
unambiguous classification of pseudo-NG bosons. All I did was to rely on 
the continuity of the spectrum in the limit € — 0. Moreover, the parametric 
scaling in (6.27) is based on the assumption of a rather specific form of 
perturbation in the effective Lagrangian. It is imaginable that even for an 
originally type-A NG boson, -Zper will induce an operator in the effective 
Lagrangian with a single time derivative and a coefficient proportional to 
some power of e. That may change the way the gap of the pseudo-NG mode 
scales with e. This is exactly what happens for the “massive” NG bosons, 
introduced in Sect. 6.4.3. 


It is a simple application of the discussion in Sect. 6.3.2 to find how many pseudo- 
NG bosons one should expect in a given system. I will denote as Ge and He the 
symmetries of the action and of the ground state in presence of the perturbation. In 
analogy with (6.22), I will define the commutator matrix Qe by 


bean = illO, Tees (6.28) 


The e in the subscripts reminds us that the indices A, B run over the generators of 
Ge, and that the VEV is taken in the perturbed vacuum. According to (6.24), there 
are altogether dim G — dim H — (1/2) rank ọ NG modes in the limit € — 0. By the 
same formula, dim Ge — dim He — (1/2) rank 9, genuine NG bosons survive upon 
switching on the perturbation. On the assumption of continuity of the spectrum, the 
number of pseudo-NG bosons therefore equals 


1 
NpseudoNG = (dim G — dim Ge) — (dim H — dim He) — 3 frank @ — rank oe) . 
(6.29) 
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We saw an example of a pseudo-NG boson already in Chap. 2. In that case, G ~ 
Ud)v x UC)a and H œ~ U(1)y, whereas Ge ~ He ~ U(l)y. The counting 
rule (6.29) is trivially satisfied and the prediction (6.27) for the scaling of the gap 
agrees with our explicit calculation of the mass, cf. (2.12). Next, let us have a look 
at a couple of less trivial examples. 


We already met quantum chromodynamics (QCD) in Chap. 5, see Example 5.5. 
In this case, the quark mass plays the role of the perturbation e. With n¢ flavors 
of massless quarks, QCD possesses a G ~ SU(nfL x SU(nf)r symmetry, 
spontaneously broken down to the “vector” subgroup H ~ SU(n¢)v. In presence 
of the perturbation (with equal masses of all quarks), this reduces to Ge ~ He ~ 
SU(n¢)y. The vacuum of QCD is Lorentz-invariant, so the matrices ọ and Q, are 
both identically zero. According to the counting rule (6.29), this leaves us with 
ne — 1 pseudo-NG bosons, which can be identified with a multiplet of pseudoscalar 
mesons. The fact that their mass scales with the square root of the quark mass, as 
predicted by (6.27), has been verified in numerical lattice simulations of QCD. 


As discussed before, ideal (isotropic) antiferromagnets possess a G ~ SU(2) spin 
symmetry, spontaneously broken down to H ~ U(1). The low-energy EFT for 
antiferromagnetic spin waves (magnons) is a nonrelativistic version of the nonlinear 
sigma model, introduced in Example 3.3, 


Pex 5,3 (51 jaon! don! — vs; Vni - Vn’) + en}. (6.30) 
Here os is the so-called spin stiffness, n a unit-vector variable encoding the two NG 
fields, and v the magnon phase velocity in the symmetric limit € = 0. 

For € > 0, the perturbation proportional to (n*)? describes a crystal anisotropy, 
corresponding to “easy-axis” antiferromagnets; see [20] for a mild introduction to 
antiferromagnetic magnons. The perturbation breaks the continuous spin symmetry 
explicitly down to Ge ~ U(1), consisting of rotations around the third spin axis. At 
the same time, it leaves only two possible ground states, (n)e = (0, 0, +1), both of 
which preserve He ~ U(1). By the counting formula (6.29), both magnons become 
pseudo-NG bosons. Their gap is proportional to „/€ in accord with (6.27). 
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6.4.2 Quasi-Nambu-Goldstone Bosons 


The above general discussion of pseudo-NG bosons applies in principle to any 
NG-like mode, associated with an approximate symmetry. However, the strength 
of the perturbation, and hence the size of the gap in the spectrum, may depend on 
the concrete mechanism of explicit symmetry breaking. It is in particular possible 
to have systems with a well-defined classical limit whose symmetry is explicitly 
broken only by quantum corrections. The ensuing pseudo-NG modes have been 
dubbed guasi-NG bosons [21]. I refer the reader to [22,23] for a detailed discussion 
of the spectrum of quasi-NG modes. 

To be more concrete, suppose we are given a Lorentz-invariant Lagrangian whose 
potential has a higher symmetry than the kinetic term. At the classical level, the 
spectrum of NG bosons is determined by the zero modes of the Hessian of the 
potential in the ground state. But some of these zero modes may correspond to 
symmetries of the potential that do not preserve the kinetic term. The corresponding 
fluctuations of the order parameter will receive a mass generated by quantum 
corrections. 

The lifting of the mass of the would-be NG modes does not have to be induced 
by the kinetic term. What matters is that the classical ground state is determined 
entirely by a part of the Lagrangian that has a higher symmetry than the whole. The 
reduction of the symmetry may also be caused for instance by other contributions 
to the classical potential, or by a coupling to gauge fields. 


6.4.3 Massive Nambu-Goldstone Bosons 


What the pseudo-NG bosons and their subclass, quasi-NG bosons, have in common 
is that their gap can rarely be determined from first principles. In general, it depends 
on all parameters a given theory might have. One says that the gap is not protected by 
symmetry, in contrast to genuine NG bosons whose gap is guaranteed to vanish by 
SSB. There is however a remarkable special class of pseudo-NG bosons whose gap 
can be calculated exactly using symmetry alone. Their existence was first pointed 
out about a decade ago [24] and I will call them massive NG bosons following [25]. 

Consider a system with a “microscopic” Hamiltonian H, possessing an internal 
symmetry group G. Pick one generator, Q, of G. In statistical physics, the many- 
body ground state of the system with a fixed average value of Q is determined 
by minimizing the grandcanonical Hamiltonian H, = H — uQ. Here m is the 
corresponding chemical potential, which is fixed by external conditions imposed on 
the system. If needed, one can always shift H, by a constant so that the many-body 
ground state |0),, satisfies H, |0), = 0. In this ground state, the symmetry under 
G may be spontaneously broken. We are particularly interested in the fate of those 
generators of G that are spontaneously broken but do not commute with Q. 

As long as G is compact, we can always choose Q to be part of a Cartan 
subalgebra of the Lie algebra g of G. Those generators of G that do not commute 
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with Q then organize themselves into pairs Q+ such that Q- = Qi and 


[Q, Q+] = q Q+. (6.31) 


The factor q is determined by the corresponding root vector, and is thus fixed by 
symmetry. We can always choose the generators so that both u and q are positive. I 
will now rerun the operator proof of Goldstone’s theorem detailed in Sect. 6.2.1 on 
the VEV „(0lLQ+2(t), J? (0)] 10) ,. 

Of course, (6.31) is only formal since we know that generators of spontaneously 
broken symmetry are ill-defined as operators on the Hilbert space. I therefore 
assume that (6.31) also holds for the finite-volume charges Q+Q, possibly up to 
a correction that vanishes in the limit Q —> oo. This can be ensured for instance if 
the charge densities satisfy the local commutation relation [Q, Je (x)] = +q J (x). 
One can then define a related pair of charges via “time evolution” with respect to 
Ay, 


¥OiQ(t) = f d% exp(i,,t — iP - x) J? (0) exp(—iH,t + iP - x) 
Q 


(6.32) 


= MO Oyo (etH = THI Ool). 


The point of this redefinition is that the spectrum of the many-body system consists 
of eigenstates of H, rather than H. I will keep the notation E, (p) for the eigenvalue 
of H, (excitation energy) in the eigenstate |n, p). Following the same steps as in 
Sect. 6.2.1 then leads to an analog of (6.11), 


„OILO, J? 0) 10), = e% (OIF O+.0(¢), J? 0] 10), 


2 
| (6.33) 


= ela I d'p [ovi-iz onto (01/2) I. p) 
l è d 0 
— expl-+iEn(p)#154(—p) |, (01/20) In, p)| | f 


With positive u and q, the assumed time independence of VEVs of commutators 
of Q.¢(t) in the Q — oo limit guarantees that limp+9 ,,(0|J2 (0) |n, p) = O for 
any eigenstate |n, p). Assuming that the VEV we started with is nonzero, on the 
other hand, ensures the existence of a state for which ,, (0| J? (0) |n, p) 4 0 and 


lim En (p) = uq . (6.34) 
p>0 


This is our massive NG boson. With some extra effort, one can also derive a counting 
rule for the number of massive NG modes in a system. To formulate such a rule, 
we adapt (6.22) for the present problem by introducing 04,48 = i([Qa, J?) u and 
On,AB =i((Qa, JP) u- The matrix @ is constructed using the generators of the 
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subgroup of G that commutes with Q. The number of massive NG modes then 
is 


1 z 
NmassiveNG = 5 (rank Qu — rank Q,,) . (6.35) 


See [25] for a detailed proof. 

The counting rule (6.35) may resemble the last term in (6.29) that counts 
pseudo-NG bosons. It may thus be worthwhile to stress the difference. Namely, the 
matrices @ and ọe in (6.29) are evaluated in different ground states, corresponding 
respectively to the absence and presence of the perturbation. On the contrary, both 
matrices Q,, and @, in (6.35) are evaluated in the same ground state, “perturbed” 
by the presence of the chemical potential. The difference is best illustrated on an 
example. 


Example 6.10 


Let us have one more look at antiferromagnets, this time assuming perfect isotropy 
and spin symmetry, that is G ~ SU(2) and H ~ U(1). In an external magnetic 
field B, the Hamiltonian of the antiferromagnet receives a contribution —WmB - S, 
where um is the magnetic moment and S the operator of total spin. The combination 
[Lm |B| plays the role of a chemical potential for the projection of S into the direction 
of B. The latter can without loss of generality be chosen as, say, $3. 

Let us first treat the effect of the magnetic field as a perturbation in the sense 
of Sect.6.4.1. The field breaks the spin symmetry explicitly down to Ge = 
U(1). This necessarily implies rank ọe = 0. Somewhat against the intuition, the 
antiferromagnet responds to the magnetic field by aligning its spins largely in a 
direction perpendicular to B. This ground state breaks the residual exact symmetry 
so that He ~ {e}. At the same time, in the unperturbed antiferromagnetic ground 
state, there is no net magnetization so that rank ọ = 0. Equations (6.22) and (6.29) 
then tell us that there is one genuine, type-A NG boson and one pseudo-NG boson. 
Unlike in the case of an easy-axis anisotropy, one of the two magnons therefore 
remains gapless. 

The fact that the magnetic field acts as a chemical potential allows us to make 
a stronger statement about the spectrum. The magnetic field does induce net mag- 
netization in the perturbed ground state, given by nonzero ($3), = —i([S1, S2]),- 
This implies rank @,, = 2. At the same time, rank @,, = O simply because there 
are not enough symmetry generators left intact by the magnetic field. In accord 
with (6.35), we then expect one massive NG mode in the spectrum. In other words, 
the previously predicted pseudo-NG magnon has a gap exactly fixed by (6.34). 
The algebraic factor g equals one thanks to the commutator of $3 with the ladder 
operators S+ = S1 + 152, [S3, St] = +S 4. Hence the gap of the magnon equals 
Lm |B]. 
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An interested reader will find further examples of massive NG bosons in [25]. 
Let me just stress that the exact expression (6.34) for the gap is not the only 
special property of massive NG bosons. The peculiar perturbation by a chemical 
potential coupled to a conserved charge leaves us with a set of exact, albeit modified, 
conservation laws. These in turn impose exact constraints on the interactions of 
massive NG bosons. As a result, the Adler zero principle (cf. Sect. 2.2) remains valid 
for massive NG modes, although it is generally violated for pseudo-NG bosons [26]. 

As an aside, note that not every NG boson that acquires a gap due to a chemical 
potential is a massive NG boson [27]. The simplest example of such an exception 
is provided by choosing G ~ SU(2) and breaking it completely in the ground state. 
Provided none of the three generators develops a VEV, the spectrum contains three 
type-A NG bosons. Upon adding a chemical potential u for one of the generators, 
two of the NG modes receive a gap, proportional to u. Only one of these is however 
a massive NG boson, as is easily checked using (6.35). 
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Part Ill 


Spontaneously Broken Internal Symmetry 


| Check for 
updates 


In Parts I and II of the book, I reviewed the basic physics of spontaneous symmetry 
breaking (SSB) and the corresponding Nambu—Goldstone (NG) bosons. In order to 
keep the discussion simple, I relied mostly on general but elementary field-theoretic 
arguments. Where explicit results were necessary or useful, I resorted to simple 
models. However, one of the main virtues of SSB is universality: the low-energy 
physics is largely independent of the microscopic, short-distance details. This agrees 
with the general spirit of effective field theory (EFT); cf. Sect. 1.1. What SSB does 
for us is ensure a separation of resolution scales, which makes it possible to build a 
low-energy EFT solely using the NG degrees of freedom. 

I will now set out on the main quest of this book: to develop an EFT framework 
for SSB that is based on symmetry alone. Thanks to the universality of SSB, the 
predictions of such a framework are guaranteed to match those of any microscopic 
theory with the same symmetry-breaking pattern. The construction of the EFT 
requires two main steps. The first step is to realize the action of the symmetry solely 
in terms of a set of NG fields. This problem is dealt with in the present chapter. The 
results will be used in the next chapter to construct effective actions for NG bosons. 

The celebrated Wigner theorem of quantum mechanics states that any symmetry 
of a quantum system can be realized by a linear unitary or antilinear antiunitary 
operator on its Hilbert space (see Chap. 2 of [1] for a detailed discussion). As a 
consequence, states in the spectrum of a quantum system can be organized into 
multiplets, corresponding to irreducible representations of its symmetry group. In 
quantum field theory, the concept of linear realization of symmetry is naturally 
lifted from states to fields. In a perturbative setting, one usually has a one-to-one 
correspondence between one-particle states and (elementary) fields. Moreover, the 
fields span a linear space. One then expects the symmetry to act on fields linearly 
via some representation just like it does on states in the Hilbert space. 


© The Author(s) 2024 111 
T. Brauner, Effective Field Theory for Spontaneously Broken Symmetry, Lecture 
Notes in Physics 1023, https://doi.org/10.1007/978-3-03 1-48378-3_7 


112 7 Nonlinear Realization of Symmetry 


A field theory of a single complex scalar field @ may possess a U(1) symmetry under 
which the field transforms as ¢ — ed, where € is a real parameter. In this case, 
the field @ belongs to a complex one-dimensional representation of U(1). Similarly, 
the SO(n) “linear sigma model” includes an n-plet of real scalar fields ¢', subject to 
symmetry transformations øt —> R!.¢/, where R € SO(n). In this case, the fields 
¢' belong to the vector representation of SO(n). 


As we saw in Sect.5.3, however, a symmetry that is spontaneously broken is 
not necessarily realized by a set of unitary operators on the Hilbert space of states. 
Likewise, as illustrated in Chap. 2, it may be convenient to use field variables that 
do not belong to a linear representation of the symmetry. This may even become a 
necessity in the low-energy EFT where only the NG bosons of the broken symmetry 
are present; we simply do not have enough degrees of freedom to fill complete 
multiplets of the symmetry group. 


The complex scalar field of Example 7.1 can be represented by its real and imaginary 
parts, ¢ = ¢! + i¢?. These span the vector representation of SO(2) ~ U(1). When 
the U(1) symmetry is spontaneously broken, it may however be more convenient to 
use the exponential parameterization of the field, ¢ = ge’, in terms of its modulus 
o and phase @. In the low-energy EFT, the modulus field is integrated out and the 
only remaining degree of freedom is the NG field 0. The latter transforms under 
e€ € U(1) as 0 — 6 + €, which is not a linear representation. This is not merely 
a matter of bad choice of parameterization; the EFT contains a single real field, yet 
the group U(1) does not have any nontrivial real one-dimensional representations. 


With the above observations in mind, I will develop in this chapter a formalism 
for nonlinear realization of symmetries. Mathematically, this amounts to generaliz- 
ing the concept of a linear representation of a symmetry group to that of an action 
of the group. The space on which the group acts need not be linear itself; we can 
think of it as some manifold. Section 7.1 introduces the necessary mathematical 
terminology. The main argument, leading to a classification of possible nonlinear 
realizations of symmetry, is presented in Sect.7.2. A reader interested mainly in 
ready-made results of the formalism may want to proceed directly to Sect. 7.3; 
this collects a number of practically useful formulas that I will refer to in the 
following chapters. Finally, Sect.7.4 offers an alternative geometric viewpoint 
which illuminates some of the mathematical structure used to realize a symmetry 
nonlinearly. While most of the chapter is phrased in a rather elementary language, 
this last section relies on some concepts of differential geometry in an extent covered 
in Appendix A. 
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7.1 Group Action on Manifolds 


To motivate the mathematical language that I need to introduce, suppose that we 
are given a theory of a set of fields, y', that possesses a symmetry group, G.! We 
would like to understand how to realize the symmetry group in terms of a set of 
transformations of the fields, 


Tr:viov'=F(y,9), g€G, (7.1) 


where the functions F! are assumed to be smooth in both of their arguments. The 
set of transformations 7, is constrained by the requirement that it respects the group 
structure of G. Thus, the unit element e € G must be represented by the identity 
map id, F! (y, e) = w. Consistency with group multiplication requires that 


F (y, gig2) = F (FW, g2). 81), 82EG. (1.2) 


Finally, the transformation induced by the inverse of an element g € G has to satisfy 
FE, 3).g J=w', geG. (7.3) 


In the terminology introduced in Chap.4, (7.1) is an example of a point 
transformation [2,3]. The class of point transformations is clearly much broader 
than that of mere linear transformations, induced by a representation of G on 
the fields. It is therefore worthwhile to recall that even point transformations do 
not exhaust all conceivable, and physically relevant, realizations of symmetry. 
First, the field transformation may in principle depend explicitly on the spacetime 
coordinates. This feature can be included under the umbrella of point symmetries by 
treating fields and coordinates on the same footing; this will become relevant later 
when we talk about spacetime symmetries. Moreover, it is perfectly possible that 
the transformation of the fields also depends on their derivatives; this was dubbed 
generalized local transformation in Sect.4.1. Such generalized symmetries play a 
minor role in this book, yet we will see some concrete examples in Chap. 10. 

The restriction to point symmetries of the type (7.1), which I will make from now 
on unless explicitly stated otherwise, is a matter of practical convenience. Namely, 
it will allow us to disregard the fact that yy‘ actually are fields, that is functions 
of spacetime coordinates. Instead, I will treat them as independent variables that 
the group G acts upon. With this important technical assumption, we can now 
reformulate the problem of finding all possible realizations of a given symmetry 
group G on the fields y’ in geometric terms. 


' Many of the concepts introduced below can be applied without change to any, even finite, group. 
However, I will always have implicitly in mind a connected Lie group, or the component of a Lie 
group connected to the unit element. 
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Consider a manifold M such that each point x € M is uniquely specified by 
a set of values yÝ. We can think of y’ as a set of (possibly only locally defined) 
coordinates on M. An action of the group G on M is a set of smooth invertible 
maps T, : M —> M that satisfy the group constraints 


Te=id, Teg =TgoTg,  Tea=(T) t, Sen ereG. 
(7.4) 


In somewhat more abstract terms, the action of the group on the manifold is defined 
by a homomorphism from G to the group of diffeomorphisms on M. 


It is important to distinguish actual symmetry transformations on M from 
a mere change of coordinates. The action of the group G on M is defined 
geometrically by the maps T, without reference to a particular set of 
coordinates yr’. Depending on the choice of coordinates, the same map T, 
may correspond to different functions F’ as defined by (7.1). The freedom 
to choose coordinates on M mirrors the freedom to choose field variables 
in a given field theory. On the one hand, the independence of geometric 
properties of manifolds on the choice of local coordinates is a cornerstone 
of the language of differential geometry. On the other hand, it is an important 
result of quantum field theory that physical observables such as the S-matrix 
are invariant under (nearly) arbitrary field redefinitions [4,5]. See [6] for a 
recent pedagogical discussion of this issue. 


Let us now introduce some further terminology. For a given action of the group 
G on the manifold M, the orbit of a point x € M is the set of all points on M that 
can be reached from x by the action of some group element, 


O; = {T,x |g E€ G}. (7.5) 


The relation y ~ x if and only if there is a g € G such that y = Tx is an 
equivalence. Orbits of the group G on the manifold M are the equivalence classes 
of this relation. As a consequence, M is a disjoint union of a (possibly infinite) set 
of orbits. 

For a given point x € M, one defines its isotropy group (also called the stabilizer 
or the little group of x) as the subgroup of G that maps x to itself, 


Hy ={heG|Thx =x}. (7.6) 


Two points lying on the same orbit of G have isomorphic isotropy groups. Indeed, 
if y = T,x, then for any h € Hy we have Tong = 1,Th T,-1y = 1,T,x = Tx = 


y. Conversely, it is easy to check that Tpy = y implies g~'hg € H,. Thus, Hy and 
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H; are conjugate as subgroups, 


Hr,x = gHyg'. (7.7) 


The rotation group SO(2) acts on the Euclidean plane by rotations around the origin, 
see Fig. 7.1 for an illustration. All points of the plane away from the origin have a 
trivial isotropy group. The corresponding orbits of SO(2) are circles centered at the 
origin. The origin itself forms an orbit with the isotropy group SO(2). 

This example has a straightforward generalization to the action of SO(n) on the 
Euclidean space R”. There, the origin has the isotropy group SO(n). All other points 
x € R” have the isotropy group SO(n — 1), corresponding to (n — 1)-dimensional 
rotations that leave the line connecting x to the origin fixed. The corresponding 
orbits are (n — 1)-dimensional spheres centered at the origin. 


In somewhat loose terms, we can say that all points lying on the same orbit of 
G on the manifold M have “the same” properties, since they can be related by a 
symmetry transformation. We thus expect manifolds consisting of a single orbit to 
have particularly simple geometric properties. In this case, the action of the group 
G on the manifold M is called transitive; any point on the manifold can be reached 
from any other point by the action of a suitable group element. A manifold equipped 
with a transitive action of a group is referred to as a homogeneous space. 

Homogeneous spaces will play a central role throughout the rest of this book. 
One of the special properties of a homogeneous space is that its structure is 
completely determined by the group G and its subgroup H that specifies the isotropy 
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group of the homogeneous space.” To see why, let us introduce a relation between 
two elements of G: g2 ~ gı if and only if there is an h € H such that g2 = gih. 
This is an equivalence relation. As a consequence, the group G is partitioned into a 
disjoint union of the corresponding equivalence classes. The equivalence class 


gH = {gh|h € H}, geG, (7.8) 


is called the left coset (or simply coset) of H in G. Incidentally, the cosets 
themselves can be viewed as orbits of a group action if we treat the group G as 
the manifold on which the subgroup H acts by multiplication from the right. 

The quotient set G/H, called the coset space, is the set of all cosets of H in G. 
Despite the different appearance, this is just another mathematical realization of a 
homogeneous space. Indeed, consider a homogeneous space M along with a single 
point x € M and its isotropy group H. It is easy to see that for any two elements 
81, 92 € G, Tg, x = Tg, x if and only if g1 ~ go, that is if gı and g2 belong to the 
same coset. Hence there is a one-to-one correspondence between the elements of 
the coset space G/H and the points of the manifold M. 


The Euclidean group ISO(n) consists of proper rotations and translations in R” 
and their combinations. It obviously acts transitively on R” for the simple reason 
that any point in R” can be reached from any other point by a suitable translation. 
Each point in R” has a its stabilizer a particular subgroup of ISO(7), consisting of 
SO(n) rotations around that point. Hence R” equipped with the action of ISO(n) is a 
homogeneous space, equivalent to the coset space ISO(n)/SO(n). This view of the 
Euclidean space in terms of its symmetry group follows the “Erlangen program,” 
put forward by Felix Klein in 1872. 


The rotation group SO(n + 1) acts naturally on the n-dimensional unit sphere S” if 
one thinks of the latter as embedded in R”+!. We can however think of the sphere S” 
in itself as our manifold M. In that case, the action of SO(n + 1) becomes transitive; 
any point on the sphere can be reached from any other point by a suitable rotation. 
The isotropy group of any point on the sphere is a particular SO(n) subgroup of 
SO(n + 1). Hence, the sphere $” equipped with the action of SO(n + 1) is a 
homogeneous space, equivalent to the coset space SO(n + 1)/SO(n). 


? Since the isotropy group is the same up to conjugation for all points on the same orbit, and hence 
all points of the homogeneous space, we can drop the subscript x. 
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Having set up the necessary mathematical background, I can now give a concise 
formulation of the main goal of this chapter. In order to understand how a given 
symmetry group G can act on a given set of fields, we need to classify all possible 
actions of G on a given manifold M. This will be accomplished in the following 
section. A reader seeking further details on the mathematical background reviewed 
above is recommended to consult Chap. 13 of [7]. 


7.2 Classification of Nonlinear Realizations 


At first sight, the task to classify all actions of an arbitrary group G on an arbitrary 
manifold M seems hopeless. We would only know what to do in the case of a 
linear action on a vector space, where the problem boils down to the good old 
representation theory of Lie groups. How can we make use of this? 

In this section, I will answer the above question, following the classic work of 
Coleman, Wess and Zumino [8]. The key step is to observe that the action of a group 
on a manifold can be partially linearized by a suitable choice of coordinates on the 
manifold. Let us choose a fixed point x9 € M. We can always introduce a set of 
coordinates y’ in the neighborhood of xo such that x9 maps to the origin, yi = 0. 
Now any linear transformation leaves the origin intact. On the contrary, only the 
isotropy group H,, keeps xo fixed; all the other elements of G translate xo to some 
other point on the manifold. Thus, the best we can hope for is that we find a set of 
coordinates in which the action of Hx, not of the whole group G, becomes linear. 

We will see that provided the isotropy group Hx is compact, one can indeed 
construct a set of local coordinates on M in the neighborhood of xo in which the 
action of Hyo is linear. The good news is that this is in fact sufficient to classify 
all nonlinear realizations of the whole group G. The details of the argument are the 
subject of the following two subsections. 

While this whole chapter is intended to develop a mathematical formalism that 
will prove invaluable later, it might be helpful for the reader already now to keep in 
mind the corresponding concepts pertinent to SSB, introduced in Chap. 5. The fixed 
point x9 € M corresponds to the selected vacuum state, or the associated order 
parameter. The subgroup Hyo that leaves xg intact is analogous to the subgroup 
of unbroken symmetries. Finally, the coset space G/H,,, which will turn out to 
span a submanifold of M, is analogous to the vacuum manifold. For the reader’s 
convenience, this correspondence is summarized in Table 7.1. 


Table 7.1 Correspondence between the mathematical terminology used in this chapter and the 
physics terminology introduced in Chap. 5 


Mathematics terminology Physics terminology 
“Origin” x9 € M Vacuum state (order parameter) 
Isotropy group Hxo Unbroken subgroup H 


Coset space G/ HA, Vacuum manifold 
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7.2.1 Linearization of Group Action 


To simplify the notation, I will from now on denote the functions F’ defined in (7.1) 
directly as Y” whenever possible. Suppose that we have a set of local coordinates wy! 
on M in which the chosen point x9 maps to zero. In these coordinates, the isotropy 
group H,, will be represented by some nonlinear functions of y? whose Taylor 
expansion in vi starts at the linear order, 


y“, h) = Dh yi + OW) , (7.9) 


where h € Hy, and D(h)! j is a set of matrix coefficients. The conditions (7.2) 
and (7.3) require that the matrices D (h) form a representation of H, xo- We can then 
change the coordinates in the vicinity of xo to 


Yi (y) =f dh Dh!) wh), (7.10) 


x0 


where dh is an invariant measure on Hy normalized so that the total volume of Hy, 
is one.* The normalization ensures that near the origin, Yİ = wy! + O(w7). Hence, 
Wi are a well-defined set of coordinates in some neighborhood of the origin. In these 
new coordinates, the action of the isotropy group is linear. This can be seen upon a 
short manipulation, 


viwmn= f ah DOD PEW D= | dh D(h™'Y F} (4, hh’) 
H, ` H, 


x0 XO 


= po, f dah’) [DARN y h, hh’) = DW W , (7.11) 


Ho 


for any h’ € Hyg, where I used respectively the group composition law (7.2), the 
invariance of the integration measure, the representation property of the matrices 
D(h), and the definition (7.10) of the new coordinates. This concludes the proof 
of the statement that the action of the isotropy group Hyo can be linearized by a 
suitable choice of coordinates. 


In defining the new coordinates (7.10), I tacitly assumed that H,, has a finite 
volume so that the invariant measure on H,, in fact can be normalized to unity. 
This is a key step in the proof, which requires that the isotropy group H,, be 


(continued) 


3 A reader not familiar with group integration may find more information in Chap. 3 of [9]. I will 
not dwell on details, since we shall not need group integration again in this book. 
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compact. The technique of nonlinear realization of symmetry developed in 
this chapter is often introduced straight away under the assumption that the 
whole group G is compact. That is, however, not necessary. 


Equation (7.10) can serve as a useful practical tool to find explicitly the 
coordinates that linearize the action of the isotropy group. Let us have a look at 
a simple example, following [10]. 


Example 7.6 


The action of SO(2) on the Euclidean plane, introduced in Example 7.3, can be 
recast in terms of an action of U(1) ~ SO(2) on the complex plane C. Thus, under 
e€ € U(1), the complex coordinate z transforms as z > z’ = e€z. This action 
is linear. One can however tweak it by changing the coordinate z to w such that 
z = f(w), where f is a function analytic in the neighborhood of the origin of C 


such that f(w) = w+ O(w7). In the new coordinate w, the U(1) group acts via 
wow = fT) = fle fw)). (7.12) 


In general, w’ will be a nonlinear function of w. The original coordinate z, in which 
the action of U(1) is linear, can be reconstructed using (7.10). In case of U(1), 
the group integration in (7.10) amounts to averaging over the phase e of the U(1) 
rotation. It can be written as an integral over a unit circle in the complex plane, 


I care ee i de 
5 f dee “f (e* f(w)) = — f zf (cfw), (7.13) 
m Jo 2a jJ c 


where I introduced a new complex integration variable c = et“. Given the 
assumptions I made on f, the function f~!(cf(w))/c* of the complex variable 
c has a simple pole at the origin with the residue f (w). It then follows at once from 
the residue theorem that the integral in (7.13) evaluates to f(w) = z, as expected. 

Note that the coordinate z is not uniquely specified by the requirement that the 
action of U(1) is linear. We can for instance introduce a new variable w via 


w = zf (zz), (7.14) 
where f is a smooth real function such that near the origin, f(zz) = 1 + O(zz). 


Then w is a well-defined coordinate in some neighborhood of the origin of C, upon 
which e® € U(1) acts linearly as w > ew. 
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The above example suggests that the local coordinates W' in which the isotropy 
group Hx, acts linearly are generally ambiguous. Namely, under the action of Hyg, 
the manifold M splits into a disjoint union of orbits. The action of H,, will remain 
linear if we rescale the coordinates Y’ by an arbitrary H vo-invariant function on 
M, that is a function which takes a constant value on any orbit of Hx}. The only 
constraint is that such a rescaling leads to a well-defined set of new coordinates. 
The following example provides a nontrivial illustration of this ambiguity. 


Example 7.7 


Consider the action of G ~ SU(2) x SU(2) on M ~ SU(2). As already hinted 
in Example 5.5 and explained in detail in Sect.9.1, this is important for a low- 
energy EFT description of hadron physics. For a given U € M and a given element 
(gL, gR) € G, the action is defined by 


UuU— ele. ; (7.15) 


The isotropy group of Uo = 1 is the “diagonal” subgroup of G, Haj, ~ SU(2), 
consisting of elements of the type (g, g), that is gL = gr = g. It is easy to guess 
a triplet of coordinates y’, parameterizing M in the vicinity of Uo, on which Hyp 
acts linearly. Some common choices are 


. Ses 
U=c¥t | eae m, U=1/1-yW2+iv-t, (7.16) 


where t is the vector of Pauli matrices. All these choices coincide to linear order 
when expanded in powers of Yi, U = 1+ip-1t +O(y7). All of them are mutually 
connected by coordinate redefinitions of the type Y” = y’ f(w7), where f is a 
suitably chosen function. The isotropy group Hq, acts on YH in all parameterizations 
shown in (7.16) via rotations of y. Hence yw is invariant under the action of Hq, 
and any function f(y) is constant on the orbits of Huy, as expected. 


7.2.2 From Linear Representation to Nonlinear Realization 


The ambiguity in the choice of coordinates that linearize the action of Hy) around 
a chosen point x9 € M can be used to complete the classification of group actions. 
First of all, note that the set {7,.xo|g € G} defines a submanifold of M. On this 
submanifold, G acts transitively; it is thus equivalent to the coset space G/H,,. We 
can choose a set of coordinates 1“, a = 1,...,dimG/Hy,,, on it. Then we add 
another set of coordinates x}, 9 = 1,...,dim M — dim G/H,,, so that (2%, x£) 
together is a well-defined coordinate system on M in the vicinity of xg. In this 
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coordinate system, the coset space G/H,, is embedded in M as the set of points 
(z4, 0). 

As the next step, we subject the coordinates (2, x£?) to the linearization (7.10); 
with a slight abuse of notation, I will use the same symbols (z“, x?) for the resulting 
new coordinates. Importantly, the condition x} = 0 is preserved by the procedure. 
We thus end up with a set of coordinates (xf, x°) in which the isotropy group 
Hx is represented by linear transformations, and moreover the subset (zr, 0), 
parameterizing G/H,, is invariant under the action of G. The latter implies that 
the representation of Hy} on the coordinates (z“, x?) has an invariant subspace, 
that is, it is reducible. I now use once again the assumption that Hy} is compact. 
This ensures that the representation of Hy, is completely reducible. The action of 
Hx, can then be brought to a block-diagonal form, 


Th (xt, x) = (x, X’) = (D™ (hf pn”, D(A)? x7), hE Hy, 
(7.17) 


by orthogonalization that leaves the subspace (x, 0) intact. The latter feature 
ensures that we can still use the coordinates 7“ to parameterize the submanifold 
G/ Hx- 

A given point x € G/H,, with coordinates z^ can be reached from xo by the 
action of any element of G that lies in the coset of x. To proceed, we need to choose 
a concrete representative element of the coset. In line with the notation common 
in theoretical physics, I will denote this coset representative as U (7x). The concrete 
choice of the representative can be made fairly arbitrarily. There are however some 
natural requirements that will make our life easier: 


e U(x) should be a smooth function of x4 near the origin xo. 

e The origin xo itself, i.e. the trivial coset e Hyg, should be represented by U (0) = e. 

e The choice of U(x) should reflect the linearity of the action of H, in the 
coordinates 7“. 


The first two constraints can always be satisfied. As to the third, the linearity of the 
action of Hyo requires that Tax = ThTu(r)xo = Tuya’) xo where zc“ are linear in 
z“. Given that Tax = Ty Tyr) Tp-1x0 = Thu(x)h-! X05 it is natural to pick U (7) so 
that 

U(x) = hU (mh! . (7.18) 


We need to make sure, however, that this can be done consistently. 
Consider the Lie algebra g of G, which carries an adjoint action of Hyg, 


Q > hQh! , Qeg, heH. (7.19) 
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This defines a linear representation of H,, which has an invariant subspace, namely 
the Lie algebra h of Hx. Thus, the representation is reducible. Suppose now that 
it is, in fact, completely reducible. This is certainly the case when Hyo is compact. 
More generally, coset spaces G/H,, for which the Lie algebra g can be split as 
9 ~ D® g/b where both h and g/b are invariant subspaces under the adjoint action of 
Fix, (7.19), are called reductive. For a reductive coset space, we can always choose 
U (x) so that the linear action of Hy, on G/H» is realized by (7.18). We can for 
instance set U (m) = exp(ia“ Qa), where Qa,b,... is a basis of g/b. This is however 
by far not the only choice, as illustrated by Example 7.7. 


Example 7.8 


There are many examples of reductive coset spaces for which the isotropy group H 
is not compact. One can for instance start with a compact H, for which the reductive 
property is guaranteed, and then switch to a related noncompact Lie group. For a 
concrete example, take as G the Poincaré group of isometries of D-dimensional 
Minkowski spacetime and as H its subgroup, the Lorentz group SO(d, 1). The 
subspace g/b can be spanned on the generators of spacetime translations, which 
carry the vector representation of the Lorentz group. This coset space is a cousin of 
the Euclidean space R? ~ ISO(D)/SO(D), for which H ~ SO(D) is compact. 


It is easy to promote the action (7.18) of Hx, on G/H» to an action of the whole 
group G. Indeed, the action of any g € G on an element x = Ty(z)xo of G/ Hy, 
is completely characterized by the product gU (sr). The latter can be, at least in the 
vicinity of the unit element, uniquely decomposed as U(z’)h(z, g), where 2“ is 
defined by T,x = Ty _)xo and the factor h(x, g) € Hy, ensures that the correct 
representative of the coset T,x is used. 

In order to lift the action of G from the submanifold G/H,, to the whole of M, 
we now make one last change of coordinates. Namely, we define new coordinates 
wz and x® by the requirement that the point (r1, x2) € M can be expressed as 


(x, x°) = Tua) (0, x) - (7.20) 


This notation can be intuitively thought of as defining a slicing of M by orbits 
of G, starting from the subset of points (0, X£). The submanifold G/H,, is a 
special case corresponding to x° = 0. In order to see that 7%, x® in fact are 
well-defined coordinates on M, note that (7.20) defines uniquely 2“, x} for given 
a, x®. It follows from the fact that U (0) = e that if 7” = 0, then 7“ = O and 
X£ = X£. Likewise, it follows from the definition of coordinates on the submanifold 
G/H,, that if x° = 0, then xe = 0 and z^ = 7%. Hence the Jacobian matrix 
a(z, x)/0(Z, X) equals 1 at the origin, and (7.20) defines a valid coordinate system 
in some neighborhood of xo. 
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The linearity of the action of Hy is preserved in the new coordinates since 


Tr Tu (0, X) = Thyn- THO, Z) = Thuan- (0, D (h) X) (7.21) 


for any h € H,,. It is now a matter of a short manipulation to see that the action of 
an arbitrary g € G on (7.20) is already completely fixed, 


ToTu 0, X) = Touca O, X) = Tuana, g) 0, X) = Tue) Thag, X) 


= Tyan (0, DO (h, g)) xX) - 
(1.22) 


This is the end of the line. Equation (7.22) shows that in some neighborhood of a 
chosen point x9 € M, the action of any Lie group G can by a change of coordinates 
be brought to a “standard form” such that (dropping the tildes) 


U(r) > U(r’) = gU(a)h(a, g) |, xe > x? = DM (h(a, 8))8 x? , 
(7.23) 


where A(z, g) € Hy, and DO) is a matrix representation of H. xo: The only technical 
assumption that was required in the proof was that the isotropy group Hy, is 
compact. This is the main result of the chapter. We can now harvest the fruits of 
our labors. 
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Having in mind that the reader might have skipped the last, somewhat technical 
section, let me give here a brief but self-contained summary of its main result. 
Consider a manifold M equipped with an action of a group G, and let us choose 
a fixed point x9 € M. Suppose that the isotropy group H,, of xo is compact. Then 
it is always possible to redefine coordinates in a neighborhood of x9 so that the new, 
“standard” coordinates (x, x£) have the following properties. First, the point xo 
itself corresponds to the origin (0, 0). The subset (7%, 0) spans a submanifold of M, 
equivalent to the coset space G/H. Every point on the coset space can be uniquely 
characterized by a choice of a representative element U (sr) of the corresponding 
coset. The representative U (zr) can be chosen so that U (0) = e, and that the adjoint 
action of h € Hy, U(r) > hU (t)h—!, defines a linear transformation of the 
coordinates 7“. The group G acts on the coset space via left multiplication, which 
defines implicitly an element A(x, g) of Hy through 


gU (n) = U (n(n, g))h(a, g) , geG. (7.24) 


The last two properties of U (x) imply that h(z, g) = g for any g € Ay. 
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The action of an element g € G on the whole manifold M is now defined in 
terms of the standard coordinates (1%, x£) as 


U(r) > U(a' (x, g)) = gU(a)h(r, g8)! , 
(7.25) 


xe > x(x, 2, 8) = DAC, 8))% x" , 


where the matrices D(h) define some linear representation of Hx}. Altogether, the 
action of G is fully specified by the choice of coset representative U (zr), which fixes 
the first line of (7.25), and the choice of representation D(A) of Hx, which fixes the 
second line thereof. 


The above construction of the standard realization of group action goes 
through without change even if H,, is noncompact provided the coset space 
G/H,,. is reductive. In that case, however, the line of argument in Sect. 7.2 
fails and it is no longer guaranteed that the standard realization is unique up 
to a coordinate redefinition. There may then be more mutually inequivalent 
nonlinear realizations of the group G on the manifold M, of which the 
standard realization is but one example. 

Finally, one may try to follow the same steps of the construction of the 
standard realization even when the coset space G/H,, is not reductive. Then, 
however, many of the simple features of the standard realization are lost. A 
concrete example of a nonreductive coset space is worked out in [11]. 


In the standard nonlinear realization (7.25), the coordinates x° transform under 
G on their own, independently of x}. The transformation of the latter, on the other 
hand, is nonlinear in 2“ but linear in x? themselves. It is therefore possible to set 
the x°s to zero consistently. This is not surprising. In the field theory language, 
the coordinates 2“ correspond to NG bosons and their universal presence therefore 
mirrors the Goldstone theorem as reviewed in Chap. 6. The remaining coordinates 
on M, x8, represent other degrees of freedom that are not of NG nature. In the 
jargon of EFT, they are usually called matter fields. Among all the nonlinear 
realizations of the given symmetry group G with the given subgroup H, there is 
therefore a “minimal” nonlinear realization, defined on the coset space G/H, which 
only includes the NG degrees of freedom. On a general manifold M, additional 
matter fields may be present. Throughout this book, I will focus mostly on minimal 
nonlinear realizations due to their significance for low-energy EFT description of 
broken symmetries. 
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Example 7.9 


One can gain insight into the standard realization (7.25) of the action of G by 
looking at some special choices of the isotropy group. If there is a point x» € M 
such that Hy) ~ G, then in its vicinity, we do not have any coordinates 7“. The coset 
space G/H,, consists of a single point that we can represent with U = e, which is 
consistent with the first line of (7.25) if h(g) = g for all g € G. All coordinates in 
the neighborhood of xo are of the x? type. The second line of (7.25) guarantees that 
the action of G can be completely linearized, x’? (x, g) = D(g)°« x°. This is the 
usual textbook realization of symmetry via a linear representation. 
The opposite extreme, H,, ~ {e}, is more interesting. Here (7.25) reduces to 


U(r) Š gU(x), xo & xe. (7.26) 


The coset space G/Hxo corresponds to the group manifold G and carries an action 
of G defined by simple left multiplication. Whatever other coordinates x° on M 
are present can always be chosen to be invariant under G. This is quite surprising. 
We are used to working with fields spanning linear multiplets of G; it is not obvious 
that the same physical content can be encoded in a set of fields that do not transform 
under G at all. The resolution of this apparent paradox lies in the freedom to choose 
coordinates at will. Namely, if we start with a set of fields Y? transforming under G 
as We 4 D(g)°,W%, we can make the redefinition x2 = D(U()~!)°, W9. The 
new variables x? are invariant under G in accord with (7.26). 


7.3.1 Nonlinear Realization on Coset Spaces 


Can we be more explicit about the way that the coordinates (x°, x°) transform 
under the action of G? The standard realization (7.25) of the group action requires 
the knowledge of the nonlinear functions x° (x, g) and h(x, g) . We would like 
to be able to compute these, at least for small transformations, that is for g € G 
infinitesimally close to unity.* To that end, it is sufficient to consider the action of G 
on the coset space G/H; from now on will I drop the subscript x9 on H unless it is 
needed to explicitly distinguish the isotropy groups of different points on the coset 
space. Once the minimal realization of G on G/H is found, it can be extended to 
any other manifold by specifying the linear representation D(h) of H under which 
the additional coordinates x? transform. 


4I will only introduce the concept of a metric on a group, and more generally on a homogeneous 
space, in Sect. 7.4. Statements about infinitesimal distance of group elements should therefore be 
interpreted within a faithful matrix representation of the group using some standard matrix norm. 
The same remark applies whenever a sum or difference of group elements is considered below. 
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The general algorithm for calculation of the desired functions 2’“(z, g) and 
h(x, g) is as follows. Take the first line of (7.25), multiply it with U ()—!, and 
subtract U (x)! U (x) = e. This gives the master relation 


U(x) IBU (rt) = U(r)! gU (mhr, g)! — e, (7.27) 


where 8U (x) = U(x’) — U (x). For g € G that is infinitesimally close to unity, 
both U(s)~! gU(z) and h(x, g) are infinitesimally close to unity as well. By a 
systematic comparison of the left- and right-hand sides of (7.27), one can then 
determine 82“ (x, g) = x" (7, g) — x“ as well as h(z, g). 

To make further progress, we first have to establish some notation. In order to be 
able to discuss different symmetry transformations from G separately, we choose 
a basis Q4 8... of g. A subset of these spans a basis of the Lie subalgebra h and 
will be denoted with Greek indices, Q«,g,.... The rest of the generators spans the 
subspace g/ and will be denoted as Q,.,,... The structure constants of the Lie 
algebra g will be called ee with a conventional factor of i in the commutation 
relations, that is, [Q4, Qg] =i oe Qc. The structure constant does not have to be 
fully antisymmetric in its three indices even if G is compact. It does have to be 
antisymmetric under the exchange of its lower two indices though. It also has to 
satisfy the Jacobi identity, 


fiafic + fief Ea + feat es =. (7.28) 


The basic commutator of the Lie algebra g, [Q4, Qg] =i ioe Qc, can be unfolded 
into three separate conditions on the subsets of generators Qg,g,... and Qa,p,..., 


LQw, Qg] = ifs Qy , 
lOa, Qn] = ify, Qc > (7.29) 
[Qa, Qo] = i( 7 Qy + fin Qe) : 


The first of these encodes the requirement that the generators Q«,g,... span a closed 
Lie algebra (that is Sup = 0). The second of these likewise expresses the assumption 
that the coset space G/H is reductive (that is i p = 0). 

Next, we are going to need two simple statements from linear algebra, which I 
reproduce here for the sake of completeness. The first of these is usually known 
under the name Hadamard lemma, 


e4Be4 = B+[A, B] + TIA, [A, B]] + —[A,[A,[A, B]]] +, (7.30) 
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where A, B are arbitrary (square) matrices. The second identity, which to the best 
of my knowledge does not have an established name, reads 


1 
e 4Ade4 = f dt e™4(dA)e4 . (7.31) 
0 


Here the symbol d acting on A and e4 can be thought of as a differential, but also as 
a derivative with respect to whatever parameter A might depend on. Once multiplied 
from the left with e4, we can think of (7.31) as a continuous version of the Leibniz 
(product) rule, applied to e^, It follows from (7.31) combined with the Hadamard 
lemma (7.30) that whenever A is a function with values in a Lie algebra, e Ade4 
will take values in the same Lie algebra. 

Equation (7.31) prepares the ground for the introduction of a concept of central 
importance for calculus on coset spaces: the Maurer—Cartan (MC) form, 


olr) = -iU (1) !dU (1) . (7.32) 


For the time being, the reader may think of the d herein as an ordinary differential of 
a function. The true geometric significance of the MC form as a differential 1-form 
will become clear in Sect. 7.4. Recall that any element of a Lie group sufficiently 
close to unity, in our case U (zr), can be obtained as the exponential of an element of 
the corresponding Lie algebra. Equation (7.31) then implies that the MC form takes 
values in the Lie algebra g. We can split it into pieces that belong to h and g/b, and 
represent each of these in terms of their components in a chosen basis of generators, 


w=] +o, 
4 (7.33) 
w= QA, a = 0 Qa, o, =o Qa. 


Finally, by writing dU (m) = [ƏU(x)/ðzx ]dzr!, one can introduce explicit 
components of the MC form in a chosen set of local coordinates m° on G/H, 
A—,,A drí 
wt = w, dr’. 
It is instructive to check how the MC form is affected by the action of G on the 
coset space. This follows directly from (7.25). It is a simple exercise to verify that 
for given g € G, 


o) Š oa, 8)) = A, goh, g)! — ihr, gdh, 8) , 


wL) > o, 8)) = hr, go her, g) . 
(7.34) 


Note how (7.31) guarantees that œ] (zc) still takes values in the Lie algebra h. While 
not of direct relevance right here and now, the transformation rules (7.34) will help 
us understand the geometric meaning of the MC form in Sect. 7.4. 
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We still need a few last pieces of notation. Conjugation of elements of g by U(r) 
will be abbreviated as 


U(r) QAU) = v2) Oz , (7.35) 


which defines a set of nonlinear functions v (x) on the coset space. Finally, for the 
action of an element g € G infinitesimally close to unity, I will use the notation 


gretice4Q,, 


(7.36) 
Srl (m, g) = e ES), hr, g) Ne +ie K4(T) Qa. 
The © symbol indicates that I have expanded all the quantities to linear order in the 
transformation parameters €^, defined by the first line of (7.36). The second line 
thereof introduces notation for the infinitesimal versions of the functions z’“(z, g) 
and A(z, g). Of particular interest are the functions E (x) that realize the motion 
induced on the coset space G/H by the group G. 
With all the notation at hand, we can now expand (7.27) to linear order in c^ and 
compare coefficients of the various generators of g on the left- and right-hand sides. 
This leads to the identities 


va (1) = E4 (wag O) + kA COT) , 


(7.37) 
va) = EA) of (a) . 

These are still valid for any choice of the coset representative U (7). Once it is 
fixed, the functions oA (zr) are determined by (7.32). Likewise, the functions v 2 (zr) 
are fixed by (7.35). The identities (7.37) then constitute a set of linear equations 
for &4 (ar) and k4 (zc). At the origin, v (0) = A as a consequence of the fact that 
U (0) = e. The second line of (7.37) then implies that œw% (0) is nonsingular. By 
continuity, it must remain nonsingular in some neighborhood of the origin. This 
guarantees that a solution of (7.37) for &4 (x) and k4 (zr) exists and it is unique. 

I have now achieved the main goal of this subsection: to give an algorithm how, 
for a chosen set of coordinates 21“, to realize the action of the group G on the 
coset space G/H. None of the nonlinear functions involved—ot (m), v A (m), €4 (7) 
and k% (z)—can however in general be evaluated in a closed form. For practical 
applications, it is useful to have explicit expressions for these functions, even if just 
as a series expansion in a specific set of coordinates 7“. One popular choice of 
parameterization for which this can easily be done is 


U(x) = exp(i" Qa). (7.38) 


The Hadamard lemma (7.30) then tells us at once that 


1 
vB) = 8B — fhan" + 3 feahenn'n’ + O(n’). (7.39) 
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Likewise, it follows quickly from (7.31) that 


1 1 
w(x) = 64 — : Aa? + Talee ntr? +0?) . (7.40) 


Finally, (7.37) can be solved iteratively for the remaining pieces, 
a a a 1 e ça b 
£401) = 84 — San — 594A feb m 
1 1 1 
+5 (Sfi FASES SAt) r +00, 
ka) = 64 — | faa - zasa) 


1 1 
F 7 ( E i GTE + 585 fifi) nn? + O(n) . 
(7.41) 


With these explicit expressions at hand, it is easy to illustrate some of the general 
properties of the standard nonlinear realization. For instance, the action of the 
isotropy group H reduces to the linear adjoint transformation of the coordinates, 
Esr == fen? . Likewise, we find that kg (7) = ôg» which is an infinitesimal 
version of the relation A(x, g) = g for any g € H. 


All the structure introduced above takes a particularly simple form when the group 
G is Abelian. Then ¿4 (7r) = 54: the group G acts on the standard coordinates 2“ 
by a mere set of shifts. The MC form reduces to w(x) = da“ Qa. This suggests a 
simple interpretation of w; = = wfdn? Qa in the general situation when G is non- 
Abelian. Namely, œ? supplies us with a generalized derivative (or differential) of 
zx? which, as (7.34) shows, is covariant under the action of G. 


7.3.2 Symmetric Coset Spaces 


Up to some technical assumptions, we have already accomplished a complete 
classification of possible nonlinear realizations of symmetry on manifolds. In the 
language of field theory, this amounts to finding all possible point symmetry 
transformations of a given set of fields under a given symmetry group G. The 
price we had to pay for the generality of this result was the rather complicated 
transformation rule (7.25). This involves nonlinear functions on the coset space 
G/H that cannot be evaluated in a closed explicit form. There is, however, an 
important class of coset spaces for which we can do much better. 
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The coset space G/H is called symmetric, and the associated homogeneous space 
is called a symmetric space, if the Lie algebra g admits an involutive automorphism 
under which b is even and g/b is odd. In other words, we require that there is a linear 
map R : g — g such that 


RQ, Q2) = [R(Q1),R(Q2)], Q1, Q2 EG, 


RQ) =, Qeb, (7.42) 
This property guarantees the vanishing of any structure constant with an odd 
number of g/b indices. In particular, i = 0: any symmetric coset space 
is automatically reductive. In addition, Jio = 0, that is the last term in the 


commutation relations (7.29) is missing. 


The fundamental commutation relation of the SO(n) group reads 
[Jij, Sea] = i(ôik J jt + êjiJik — Sit jk — Sjx Jit) . (7.43) 


where i, j, k,l =1,...,n and Jj; is the antisymmetric tensor of angular momentum 
in n spatial dimensions. The Lie algebra of SO(n) possesses an automorphism R 
under which R(Jog) = Jog and R( Jon) = — Jan, where a, B = 1,...,n — 1. One 
can think of this automorphism geometrically as an inversion of the n-th coordinate 
axis. The coset space SO(n)/SO(n — 1) > S"—! is therefore symmetric. 

In a similar vein, the Euclidean space R” is a symmetric space. This can be seen 
most easily by recalling that R” ~ ISO(n)/SO(n). The desired automorphism R 
is the spatial inversion. Under this, all the generators of SO(n) (rotations) remain 
intact, whereas the remaining generators of ISO(n)/SO(”) (translations) change 
sign. 


Consider the “chiral” coset spaces of the type GL x Gr/Gy, where all the three 
groups GL, Gr, Gy are isomorphic to the same Lie group G; this is a generalization 
of Example 7.7 where G ~ SU(2). The chiral group GL x Gp consists of elements 
(gL, &R) where gL, gr € G. The “vector” isotropy group Gy consists of elements 
of the type (g, g), that is gg = gr = g. The generators of the chiral group include 
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two copies, Qi, 4 and Qp.a, of the generators of G. The Lie algebra of the chiral 
group is defined in terms of the structure constants i of G by 


[O..a, Ov.el=ifGOrc, (Ora, Orsl=iff,Orc. 
[OL.a, Or.pl =0. 


(7.44) 


These commutation relations are invariant under the exchange of QL,4 and QR,A, 
which defines the desired automorphism, 


R(OL A) = OR A; R(OR,A) = QLA - (7.45) 


The generators of Gy, equal to QL, 4 + Qr,a up to overall normalization, are even 
under this automorphism. The generators of the complementary space g/b, which 
are to be odd under &, can be taken as QL, 4 — QR, up to an overall factor. 


The automorphism & of the Lie algebra g can be lifted, at least locally near the 
unit element, to the Lie group G. It is then possible to choose the coset representative 
U(x) so that R(U(x)) = U(s)~!; one can use for instance the exponential 
parameterization (7.38). We now take the first line of (7.25) and multiply it with the 
inverse of its image under R. The result is a surprise: for symmetric coset spaces, 
there is a parameterization of G/H in which the whole group G is realized linearly, 


L(x) = U(r) , E) È E, g)) = 8ER! . (7.46) 


This is of such utility that whenever one deals with a symmetric coset space, one 
almost always uses the linearly transforming variable X instead of working directly 
with the coordinates 27“. Note, however, that one may need the coordinates z® if 
one wishes to extend the coset space G/H to a larger manifold M. This is because 
the transformation of x® in (7.25) requires the functions h(z, g) that depend on x“. 

With the automorphism at hand, one may easily project out the @j and w, 
components of the MC form, 


1 1 
| = ze +R(o)] , w = ze —R(o)] . (7.47) 
The latter has a practically convenient expression in terms of £, 


o, = -5U-'dEU-! = sUaE tu . (7.48) 
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Let us see how the variable È (zr) is realized on the chiral coset spaces GL x GR/ Gv 
discussed in Example 7.12. Here we can choose the coset representative as U = 
(u, u—!) where u € G; this satisfies the requirement that under the automorphism 
R(gL; ZR) = (gr, gi) of the chiral group GL x Gr, U is turned into its inverse. The 
general transformation rule as given by the first line of (7.25) then translates to 


(u, u7!) EE, (gr, gaya, wD AT) , (7.49) 


where h € G. The linearly transforming variable (7.46) is given by © = U? = 
(u?, i. where u? = U transforms under GL x Gr as U > eLUgp. 


It turns out that (7.46) can be further simplified in case R is an inner automor- 
phism of G, that is, when there is an element R € G such that 


R(g) = R-'eR, geG. (7.50) 


Then a slight modification of (7.46) leads to a variable that transforms linearly under 
the adjoint action of G, 


N(x) = X(t)R = U(x) RU (tt)! , Na) & NC, 8) = Na)! . 
(7.51) 


An advantage of trading © (x) for N (7) is that N (x)? is a constant independent of 
mz“, which may be convenient in concrete applications. Indeed, since R(R(g)) = g 
for any g €e G, it follows from (7.50) that R2 belongs to the center of G. 
Consequently, N(x)? = U(r) R2U (x)! = RÈ. 


Consider the coset space SU(2)/U(1), relevant for description of quantum systems 
with magnetic ordering such as (anti)ferromagnets. In the fundamental represen- 
tation, the generators of SU(2) are Q4 = t,4/2, and the generator of the U(1) 
isotropy group can be taken as 13/2. This coset space is symmetric thanks to an 
inner automorphism (7.50) with R = it3. While the factor of i here is required to 
make R an element of SU(2), we do not need it for the definition of N(zr). Let us 
therefore set 


N) = U(x) U(r)! . (7.52) 
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This matrix variable is unitary and Hermitian. Moreover, it is traceless and it squares 
to 1. It can thus be mapped on a unit vector variable n(zr) such that 


N(w) =T-n(z), (7.53) 


which belongs to the vector representation of SU(2). The fact that we ended up 
describing the coset space in terms of a unit vector is not a coincidence. Thanks to 
the local isomorphism of SU(2) and SO(3) we also have a local equivalence of coset 
spaces, SU(2)/U(1) ~ SO(3)/SO(2) ~ S?. 


7.4 Geometry of the Coset Space 


In this final section of the chapter, I will show that some of the structure that enters 
the standard realization of symmetry on a coset space can be given a neat geometric 
interpretation. A reader not familiar with basics of differential geometry is advised 
to consult Appendix A before proceeding. Further information about the geometry 
of homogeneous spaces at an easily accessible level can be found in [12]. 

To start, recall that each point of a coset space G/H corresponds to an entire class 
of elements of the group G. The approach I have used so far was to parameterize 
each x € G/H with coordinates m° in terms of a fixed coset representative 
U(x). The choice of U(x) is however arbitrary and can be changed locally by 
multiplication from the right; any U (x) = U(a)hA(z) with h(x) € Ay, is equally 
good. One can then promote the basic transformation rule (7.25) for U (zr) to 


Un) 22. U, 2) = gU! , (7.54) 


where A(x) is independent of g. This realizes an action of the product group 
G x Hgauge. The group G acts on U(x) by left multiplication as usual. The 
local group Hgauge isomorphic to H acts on U(x) by right multiplication with 
h()~! and encodes the freedom to choose locally the coset representative. This 
is a typical example of a gauge redundancy; any geometrically or physically well- 
defined quantity must be independent of the arbitrary choice of coset representative. 
In theoretical physics, the approach that views the left action of G separately from 
the right action of the local group Hgauge is called “hidden local symmetry” [13]. 

The different geometric roles of G and its isotropy subgroup can be intuitively 
understood by looking at Fig. 7.2. One-parameter subgroups of G define a set of 
flows on G/H that in general translate a given point x to some other point. The 
isotropy group Hy maps x to itself. It does, however, act nontrivially in the vicinity 
of x. Based on the figure, we expect that the action of H, projects to a set of 
linear maps on the tangent space 7,G/H at x. By induction, such linear Hy- 
transformations should exist for any tensor at x. 
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Fig. 7.2 Illustration of the action of G and Hy around a fixed point x (black dot) on the coset 
space. The thick solid lines indicate orbits of one-parameter subgroups of G passing through x; 
these correspond to generators from g/ and act as translations on the coset space. The thick dashed 
lines indicate some orbits of H, in the vicinity of x. The action of H, naturally induces a set of 
linear transformations on the tangent space at x 


The local group Hgauge gives us the freedom to do independent H- 
transformations of tangent vectors (and generally tensors) point by point. In order 
to be able to analyze the properties of vector (tensor) fields on G/H, we therefore 
need a way to relate tangent vectors (tensors) at different points. This resembles 
closely the motivation behind the construction of a local frame, or vielbein, and 
the corresponding connection on a manifold. The only difference to the formalism 
reviewed in Appendix A.5 is that here the structure group is not GL(dim G/H) but 
H itself. This restriction makes it possible to realize changes in the local frame in 
terms of a fixed matrix representation of H. 

It turns out that we already have both the local frame and the connection: they are 
granted to us by the MC form (7.32). Specifically, the components w° of w; define 
a coframe on G/H. By the second line of (7.34), these indeed transform linearly 
under the local action of Hgauge as they should. Likewise, the part of the MC form, 
wj, provides the necessary H-connection. This is confirmed by the transformation 
rule on the first line of (7.34). 

This observation stresses the significance of the MC form as a differential 1-form, 
taking values in the Lie algebra g. Its exterior derivative reflects the structure of G, 


dw = —idU~! A dU =iU~'!dU A U~'dU = iow Aw 
i g c l A B C paa 
= 3% NO [QB Qc] = 5 Sac Aw Oa. 


7.4 Geometry of the Coset Space 135 


This can be split into a pair of equations, one for @ and one for w_, 


1 1 
do” = 5 Spo Aw + aTa Aw, 
(7.56) 


1 
dof = Fhe” Ao + 5 Since" Aw , 


where I used the facts that Spy = 0 (the Lie algebra h closes) and that fse = 0 (the 
coset space is reductive). These so-called Maurer—Cartan equations provide a link 
between the algebraic and geometric properties of the coset space. 


7.4.1 Canonical and Torsion-Free Connection 


The local basis of 1-forms œf transforms under the action of the structure group H 
as a tangent vector. It follows from (7.34) that upon an infinitesimal transformation 
by h X e+ ic” Qu, dœ = —e®% ffa. This determines the matrix elements of the 
action of the H-connection œw on tangent vectors, in the notation of Appendix A.5, 


24, = — ho. (1.57) 


The MC equations (7.56) encode information about both the torsion and the 
curvature of this so-called canonical connection on G/H. The torsion 2-form 
follows from the second line of (7.56), 


1 
T! = dof +Q, Aa = JEP Aa. (7.58) 
Similarly, the curvature 2-form follows from the first line of (7.56), 
1 
R°, = dQ", + AX, = 5 far feao A of, (7.59) 


where I used the Jacobi identity (7.28) to simplify the result. In the language of field 
theory, this is nothing but the field-strength 2-form of œw. 

The definition (7.57) of the canonical connection arises naturally from the 
splitting (7.33) of the MC form into the h and g/b subspaces. This is not the only 
possible connection on G/H though. In fact, there is an infinite class of them. Let 
us set 


A047, = — fio" Afio, AER. (7.60) 
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This time, it takes some manipulation to derive the corresponding torsion and 
curvature 2-forms. The final result is 


1 
ae = G — i) fio" ro , 
(7.61) 


1 
‘RY, = —5 (fab fea + Mc fea — 2 foo Sha)’ 0 08 , 
which generalizes (7.58) and (7.59) to any nonzero À. 

Within the class of connections (7.60), there is one with vanishing torsion, 
corresponding to A = 1/2. One might expect that it should be possible to recover 
this connection from a suitable Riemannian metric on the coset space. This is indeed 
the case under some further technical assumptions, as I will explain below. For 
symmetric coset spaces, the whole class of connections (7.60) becomes degenerate, 
and is automatically torsion-free. 


7.4.2 Riemannian Metric 


Every Lie algebra g possesses a bilinear form invariant under the adjoint action 
of the corresponding Lie group G. To see this, just take any faithful matrix 
representation of the generators Q 4 and set 


gap =tr(QaQp). (7.62) 


The invariance under Q4 —> gQag™! for any g € G is manifest. The infinitesimal 
version of the invariance condition follows by setting g ~ e+-ie© Qc and expanding 


to linear order in eC, 


fagns + fësgan =0. (7.63) 


Since the components w° of the MC form define a basis of the space of 1-forms 
on G/H, any rank-2 covariant tensor can be constructed as a linear combination of 
ow ® œw’. Given the invariant bilinear form gap, it is then natural to introduce the 
following metric on G/H, 


8G/H = Sabo" D ow , (7.64) 


where gap is the restriction of g4pg to the g/h subspace. 

This construction is not guaranteed to work without further assumptions. A 
Riemannian metric should be positive-definite. A sufficient, though not necessary, 
condition for this is that gag itself is positive-definite, which is generally only true 
for compact semisimple Lie algebras. For what follows, I will only need the weaker 
assumption that g4g is nondegenerate and that gag = O, i.e. that the subspace g/b 
can be chosen to be “orthogonal” to . Interestingly, this alone already ensures that 
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G/H is a reductive coset space. Namely, a short manipulation using (7.63) gives 
0= fåg8sc = Sap8Dde =—fPepp = —fe.gps , (7.65) 


which implies that fè = 0 thanks to the fact that gag is nondegenerate. With the 
assumption that g,g = 0, the invariance condition (7.63) also splits into two separate 
conditions on gap, 


fay 8db + forBad =0, fi gan + figad =0. (7.66) 


The metric (7.64) is invariant under the left action of G, since the MC form 
itself is. It is however also invariant under the right action of Hgauge as defined 
by (7.34). This follows from the first condition in (7.66). The G-invariance of 
the metric guarantees the existence of a set of Killing vector fields that realize 
infinitesimal group motions on the coset space. In the local coordinates 2“, these 
are nothing but the functions £4 (r) introduced in (7.36). According to (7.37), we 
have & : (0)a; (0) = 64 at the origin. This means that the Killing vectors &,(0) 
corresponding to generators from g/b define a local frame dual to w“(0), which 
further illuminates the geometric nature of the MC form. Away from the origin, 
the duality between w; and the subset of Killing vectors realizing infinitesimal 
translations on G/H is still expressed by the second line of (7.37). One just has 
to recall that the isotropy group of x = Ty (z)xo is Hy = U(r) Ay,U (t)~!. This 
conjugation is supplied by the matrix v (zr) on the left-hand side of (7.37). The 
local basis of 1-forms w“(zr) is then dual to the local frame consisting of Killing 
vectors of the generators U(z1) Q,U mL. 


The metric (7.64) is particularly easy to evaluate explicitly on symmetric coset 
spaces. Using the expression (7.48) for œw; in terms of the linearly transforming 
variable £, we find at once that 


1 1 
gajn = 7 YOE OAL!) = 7 (0a EET dr" @ dr”, (1.67) 


where I used the abbreviation 0, = 0/dz“. As an illustration, consider the coset 
space SU(2)/U(1) ~ SO(3)/SO(2) > S? discussed in Example 7.14. Here we find 
that up to an overall factor, 


gs2 x dn @ dn = dd Q dO + sin? Ody Q dọ , (7.68) 
where I used standard spherical coordinates to parameterize the unit vector n. This 


is just an elaborate way to show that the unique SO(3)-invariant Riemannian metric 
on SŽ is given by projecting (pulling back) the Euclidean metric on R? to the sphere. 
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It remains to clarify the relationship between the metric (7.64) and the connec- 
tions (7.60). The covariant derivative of the metric with respect to these connections, 
in the direction of an arbitrary vector field v, is easily calculated in the local frame, 


(VogG/H)ab = 1gab] — cb RaW) — Bac*Q%, (0) . (7.69) 


The first term vanishes since gap is merely a set of constants. The sum of the 
second and the third term vanishes for any À as a consequence of the combination 
of (7.60) and (7.66). Thus, the whole class of connections (7.60) is compatible with 
the metric (7.64). In case the metric is (pseudo-)Riemannian, it is known that there 
is a unique metric connection without torsion, called the Levi-Civita connection. As 
shown in the previous subsection, this corresponds to the choice à = 1/2. 


The metric on the coset space is not uniquely fixed by the requirement of 
invariance under G x Hgauge. Although it was natural to start the construction 
with the Cartan—Killing form (7.62) on the whole Lie algebra g and then 
restrict it to the g/h subspace, we could have as well started from the latter. 
One then finds that possible metrics on G/H invariant under G x Heauge are 
classified by constant symmetric tensors gap invariant under the adjoint action 
of Agauge, that is gap Satisfying the first relation in (7.66). For gap that does 
not satisfy the second relation in (7.66), the connections (7.60) with à Æ 0 are 
not metric-compatible. In particular !/7Q¢ p İs not the Levi-Civita connection 
despite being torsion-free. 
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A 


Check for 
updates 


We are now finally in the position to construct a low-energy effective field theory 
(EFT) for systems with a spontaneously broken internal symmetry. We do not need 
any specific model to describe the dynamics of spontaneous symmetry breaking 
(SSB). All we need is information about the internal symmetry group G and its 
unbroken subgroup H. These determine the basic degrees of freedom of the EFT: 
the Nambu—Goldstone (NG) fields. With the known symmetry and field content, one 
must then include in the effective Lagrangian all operators allowed by the symmetry 
(Chap. 12 of [1]). This ensures that the predictions of the EFT match, to the desired 
accuracy, those of any microscopic theory with the same pattern of SSB. 

The NG fields can be thought of as (local) coordinates on the coset space G/H. 
In the previous chapter, I showed that the action of the symmetry on these fields can 
be assumed to take the “standard form,” cf. Sect. 7.3. The task to construct the most 
general action invariant under such nonlinear realization of symmetry is nontrivial. 
In Sects. 8.1 and 8.2, I will offer two different approaches to this problem. The first 
of these is more general but requires substantial background in differential geometry. 
The second approach is based on certain simplifying assumptions but has the benefit 
of only requiring elementary field theory. In the concluding Sect. 8.3, I use the EFT 
to derive the equation of motion (EoM) for the NG fields. With the help of the latter, 
I then reanalyze the spectrum of NG bosons. 


8.1 Structure of the Effective Lagrangian 


To keep the discussion simple, I will assume unbroken symmetry under continuous 
spacetime translations and continuous spatial rotations. This is just a matter of 
practical convenience; the same methodology can be applied, for instance, to 
systems whose spatial rotation symmetry spans a (possibly finite) subgroup of 
SO(d). I will also assume that the spatial dimension is d > 2. On the practical side, 
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this is needed for any continuous rotations to exist. On a more fundamental note, 
the possibility of SSB in one spatial dimension is severely restricted; see Sect. 15.2. 

The action of the low-energy EFT must inherit the symmetries of the underlying 
microscopic theory. As a rule, the symmetry admits an infinite number of operators 
in the effective Lagrangian, which require an infinite number of a priori unknown 
coupling constants. In order that the EFT has any predictive power, we need an 
organizing principle to tell us which of the allowed operators are “relevant.” Recall 
that the EFT is expected to be valid at low energies such that the only active degrees 
of freedom are the NG bosons. The individual operators in the Lagrangian can 
then be sorted by the number of derivatives they contain. The more derivatives, 
the smaller effects the operator is expected to produce. Such ordering of operators 
in the effective Lagrangian is called derivative expansion. 

Since spatial and temporal derivatives are not related by any of the assumed 
symmetries, we must count them separately. The low-energy effective Lagrangian 
can then be organized as a double series, 


Ban] ye i (8.1) 


s,t>0 


where s,t¢ denote respectively the numbers of spatial and temporal derivatives. 
Following loosely [2], I will focus in this section on contributions with at most 
two derivatives, s +t < 2. These are needed to pin down the kinetic term for all 
the NG fields, and thus carry information about the spectrum of NG bosons. At the 
same time, they encode the dominant interactions of NG bosons at low energies. 
Operators with a higher number of derivatives are classified more easily using the 
approach developed in Sect. 8.2. 

Within the diagrammatic expansion of quantum field theory, a given observable 
may receive contributions from graphs with different numbers of loops. The 
relative importance of various contributions depends on the operators that enter 
the interaction vertices and the number of loops. I will work out a precise power- 
counting scheme in Chap. 9, where concrete examples of EFTs are discussed at 
length. For the time being, I will content myself with the rule of thumb that loops 
lead to parametric suppression just like derivatives in the interaction vertices. This 
justifies the approach adopted throughout the whole book, whereby NG bosons are 
treated in the classical (tree-level) approximation. 

The analysis of the structure of the effective Lagrangian (8.1) is considerably 
simplified using the language of differential geometry. Following the presentation 
below will therefore require some familiarity with the contents of Appendix A. A 
reader uninterested in the details will find a summary of the leading part of the 
effective Lagrangian (that is operators with s +t < 2) in Sect. 8.1.4. 
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8.1.1 Reminder of the Standard Nonlinear Realization 


I will start with a brief reminder of the standard nonlinear realization of symmetry. 
The intention is to have the basic elements of the formalism at one place for easy 
reference. Further details can, if needed, be found in Chap. 7. 

Every point on the coset space G/H is labeled with a representative U(r) € G 
of the corresponding coset, where x“ are local coordinates on G/H (NG fields). The 
action of the group G on the coset space can then be defined via left multiplication, 


U(r) > U, 8) = 8U mha, g)! , (8.2) 


where A(x, g) € H. The choice of the representative U (7) is not unique, but can 
be made so that U (0) = e and that any g € H acts on U(x) by conjugation, 
U(x) EA gU()g~!. In other words, h(x, g) = g for any g € H independently of 
m’. 

The generators of G are denoted as Q 4,g,.... A subset of these, Q«,g,..., spans the 
Lie algebra h of the unbroken subgroup H. The remaining, broken generators of G 
are denoted by Qa, ba The structure of the Lie algebra g is fixed by the structure 
constants or through [Q,4, Og] = fey Qc. Under the action of a group element 
infinitesimally close to unity, g © e + ie4 Q 4, (8.2) reduces to 


Sr“ (m0, g) © €AE4 (T) , A(x, g) © e+ ieAk4 (1) Qa - (8.3) 


The functions &% (sr) define a set of vector fields on G/H, a(t) = €4 (71) 0/0“, 
whose Lie bracket reproduces the structure of 9, 


la, £B] = fipc - (8.4) 
An object of fundamental importance is the g-valued Maurer—Cartan (MC) form, 
w(1) = —iU (m) tdU (r) = w4 (1) Oa . (8.5) 

This can be split into a part belonging to the subspaces þh and g/b, 
O=o)+o1, wj = oQ , w, =o"Q,. (8.6) 


The components w^ of the MC form and the vector fields £ 4 are mutually dual in a 
well-defined sense. Their precise relationship is expressed by the identities 


lg, @” = vi = k$ E tg @" = v4 7 (8.7) 


! The basis Qa,p,... is to be chosen so that it spans a subspace g/b of the Lie algebra g of G carrying 
a representation of H, hence g ~ b @ g/b. 
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where the matrix function v} B (ar) is defined by the conjugation 


U(r) QAU) = v8) Oz . (8.8) 


Under G, the unbroken and broken parts of the MC form transform as 


wy (I) Š oa, 8)) = h(t, goh, g)! — ihr, gdh, 8)! , 


oim) > oLa, g)) = hlr, gjo mhOr, gy! . 
(8.9) 


In the language of differential geometry, infinitesimal symmetry transformations of 
tensor fields on G/H are given by the Lie derivative along the vector fields & 4. 
Combining (8.3) and (8.9) thus corresponds to 


Leo” = — fe Kho” — dk, Leo = — fE Kho (8.10) 


Finally, the exterior derivative of the MC 1-form is expressed by the MC equation 
dw^ = (1/2) f co? ^ œC. In terms of the wj and w, components of the MC form, 
this breaks down into 


do” = = L yg oP Ao’ += Tygo Aw 
(8.11) 


dof 


1 
TOET EET TEN 


8.1.2 Lagrangians with Two Spatial or Two Temporal Derivatives 


Invariance of the effective action under a symmetry group G requires that the 
Lagrangian (density) be quasi-invariant, that is invariant up to a surface term. 
Internal symmetry transformations, covered in this part of the book, do not 
involve any derivatives of NG fields. This implies that all the different parts 
LEP of the effective Pipe Me 1) must be quasi-invariant separately. The 
piece without any derivatives, we , must be strictly o invariant; its varia- 
tion u G oe an a derivative of any local operator. However, aoe 
Lez, Aa = 449 oy əx! = 0 leaves a constant, 7“-independent Zí eff 0:0) as the 
only option. The a expansion of the effective Lagrangian thus starts with 
operators with at least one pais Having a sole spatial derivative is forbidden 
by rotational invariance, hence go eff ) = 0 for any t > 0. The only options for (s, t) 


? It is in principle possible that the quien of Ler 00 under G is a feld, independent constant. 
This can however only happen if KA ) isa tadpole operator, linear in 2“. Such operators must be 
discarded for the EFT to be pertuthatively well-defined. 
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with s + ¢ < 2 that are left are therefore (0, 1), (0, 2) and (2, 0). The Lagrangian 
BA o D with one temporal derivative will be analyzed in Sect. 8.1.3. Here I will focus 
on ae and POA , which turn out to be much easier to understand and thus 
constitute a good starting point. 

Invariance under spacetime translations and spatial rotations restricts the two- 
derivative Lagrangians to the generic form 


1 1 
LOO = = 5 8ab(r) V1" - Vr? — aban rye dpnt Bsr” 


i (8.12) 
Leg? = Ea h” , 
where bap (7), &ap(7) and gap(7) are some functions on G/H. Note that the bap 
term can only exist in d = 2 spatial dimensions. Also, I have discarded from the 
outset operators containing a second derivative of 2“, since those can be brought to 
the form (8.12) by partial integration. 

The gap and gap terms must be strictly G-invariant. Indeed, should an infinitesi- 
mal transformation of, say, the gap term produce a surface term, it would have to be 
of the form V - [fa(m)V z1]. But that would inevitably contain a second derivative 
of x° that was not present in the original Lagrangian. This argument does not apply 
to the bap term, which is antisymmetric in spatial derivatives. However, in that case, 
possible quasi-invariance can be disregarded on physical grounds. Namely, the bap 
term contributes to the canonical Hamiltonian density, and its mere quasi-invariance 
would imply that the energy density of the EFT is ill-defined. All in all, both 2" 
and ae ) must be strictly invariant under the symmetry group G. 

Let us see what strict G-invariance tells us about the function gg,(z). The latter 
gives rise to a symmetric rank-2 tensor field on G/H, 


g(t) = gap (T )dT" Q dr? . (8.13) 


We know that the broken components of the MC form, w“(z), furnish a basis of the 
cotangent space (coframe) to G/H; see Sect. 7.4 for a detailed justification. In this 
basis, the tensor g(zr) can be expanded as 


g(t) = Kap (1) @" (1) Q w? (1) , (8.14) 


where the components kg,(a) are some as yet unknown functions on G/H. 
Using (8.10), we readily calculate the Lie derivative along & 4, 


£48 = (Le Kab — Keo Shak — Kac Ffpkh)o @ a? . (8.15) 
The requirement of strict G-invariance then leads to the following condition on Kab, 


OKab 
ome ` 


KE Féakeb + ffpKac) = £4 (8.16) 
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It is now convenient to resort to the exponential parameterization, 

U(x) = expr" Qa) , (8.17) 
in which &4 (r) = 64 + OC) and k4 (zr) = 64 + OC), cf. (7.41). By choosing the 
index A in (8.16) in turn as an unbroken and broken index, we thus get the following 


initial conditions at the origin of G/H, 


OKab 


fi hes + Fophaelag = 0 =0. (8.18) 


dae mz=0 


For a fixed broken index A, (8.16) can be viewed as an ordinary differential equation 
that uniquely determines Kap (7r ) along the integral curve of & 4, passing through the 
origin of G/H. We conclude that «gp must be constant, subject to the constraint 


faakeb + fabKac = 0. (8.19) 


This is equivalent to requiring that Kap is a constant invariant tensor under the adjoint 
action of the unbroken subgroup H. 

The same reasoning applies without change to any G-invariant operator con- 
structed solely out of the broken part of the MC form, w1. As a consequence, the 
most general form consistent with G-invariance that the functions bgp(77), gap (7) 
and gap(7 ) can take is 


bab t) = cdo (rad (xr) , 
(8.20) 
abl T) = Ked OSTO T), Babn) = Keos (tof (1) . 


The constant matrices Aap, Kab and Kap are all invariant tensors under the adjoint 
action of H. In the exponential parameterization (8.17), œp (7) = 6) + O(s) and 
thus ab = bap (0), Kab = ab (0) and Kab = Zab (0). 

For the kinetic term of the NG fields to have the correct signature, the symmetric 
matrices Kap and Kap should be positive-definite. Hence, both g(z) and its cousin 
g(x) constitute a G-invariant Riemannian metric on G/H. These two metrics are 
in principle independent from each other. However, in case G is compact and 
semisimple and all the broken generators Qa span a single irreducible multiplet of 
H, Schur’s lemma requires that kab, Kap be proportional to each other. The metrics 
g(z) and g(x) then coincide up to an overall factor. 


Suppose that the symmetry group G is completely broken. Then the matrices Kap, 
Kab, Aab remain unconstrained by symmetry, the only physical requirement being 
the positive-definiteness of kap and Kap. This, however, does not mean that when 
constructing an EFT for NG bosons on G/{e} ~ G, we have to work with arbitrary 
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matrices of couplings of rank dim G. The number of independent parameters can 
be reduced by a judicious choice of coordinates 2“ on G. In the theory of small 
oscillations of mechanical systems [3], it is known that two quadratic forms, at least 
one of which is positive-definite, can be simultaneously diagonalized. Thus, we can 
always assume that, say, Kab = dap and moreover that Kap is diagonal and positive- 
definite. We just have to keep in mind that this simplicity is a consequence of a 
particular choice of coordinates, not of symmetry. This may restrict the otherwise 
arbitrary choice of parameterization of the coset space. 


8.1.3 Lagrangians with One Temporal Derivative 


The part of the effective Lagrangian with one time derivative reads, generically, 


LOY = ca)" . (8.21) 
The functions cg(z) define locally a 1-form on G/H, c(z) = cq(a)dx%. The 
requirement of quasi-invariance of an D under the action of G is then equivalent to 


Ly c = dé, , (8.22) 


where C,(z) is a set of local functions (0-forms) on G/H. Using the fact that the 
Lie derivative commutes with the exterior derivative, one deduces that Lg, (dc) = 
d(Lg,c) = d(de,) = 0. Hence dc(zr) is a strictly G-invariant closed 2-form on 
G/H. Strict G-invariance requires that 


de(x) = oao (1) Aw? (mr), (8.23) 


where oap is a constant antisymmetric matrix, invariant under the adjoint action of 
H. This follows by the same argument I used above to derive (8.20). 

Not every H-invariant matrix oap will do, however, since the 2-form dc(z) must 
also be closed. Upon using (8.11), closedness of dc(zr) is seen to be equivalent to 


féach F JépOac = 0; fiz + 7. ena F fii ova =0. (8.24) 


The former of these conditions just reasserts the H-invariance of og». The latter 
constitutes a new constraint that we must now deal with. I will not be able to give a 
fully general aoe solution to the algebraic conditions on og», and thus a general 
solution for as We can get very far even without an explicit solution, though. 
First, note that (8.24) is a set of linear equations for oap. In a concrete case where 
the structure constants are known, the space of solutions to (8.24) can therefore 


be found using elementary linear algebra. Second, we can do even better and link 
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possible solutions for og» to the structure of the Lie groups G, H and their Lie 
algebras 9, b. 

In the parameterization (8.17), dc() = (1/2)[oap + Or) Jdr" ^ dz”. Thus, the 
corresponding Lagrangian can be power-expanded in the NG fields as 


1 
LE) = zoan don’ + O(n) . (8.25) 


Following the argument of Sect. 6.3.2, the matrix oap can be related to the set of 
vacuum expectation values (VEVs) of commutators of broken generators, 


aie ee (8.26) 
V>0o V 
There is a possibility that the representation of the symmetry group G on the 
Hilbert space of our system features a central extension. Let us write the extended 
commutation relation of symmetry generators as [Q4, Qg] = i on Qc +iVZzAB, 
where zag are the densities of the central charges. Then we find that 


7 E 
dab = — fap lim ~E — (zab) - (8.27) 


We conclude that possible solutions for oap directly reflect the Lie algebra of 
symmetry generators, including possible central extensions. One class of solutions 
to (8.24) that always exists is 

Oa =—fGoc suchthat foa =0. (8.28) 
The latter constraint expresses, once again, the invariance of the constant tensor 
oa under the adjoint action of H. This solution corresponds to vanishing (VEVs 
of) central charges, and o, can then be interpreted as the density of Q4 in the 
ground state. The H-invariance of o4 descends directly from the H-invariance of 
the ground state. For this class of solutions, (8.23) is readily integrated using the 
MC equation (8.11), leading to 


c(t) = -oo m) or LOY = -oof mi" , (8.29) 


up to addition of a closed 1-form (surface term). The variation of 2 under 
infinitesimal transformations from G can now be computed with the help of (8.10), 


Le,C = —oply, o” = op (fe kia + dF dk4) = d(ouk9) , (8.30) 
where in the last step I used that og fÈ = 0 by (8.28). The operators proportional 


to o, are strictly G-invariant. Mere quasi-invariance of the Lagrangian thus requires 
nonzero oy, that is nonzero VEV of an unbroken charge in the ground state. 
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In ferromagnets, G/H ~ SU(2)/U(1) ~ S?. Up to an overall factor, there is a 
unique closed and G-invariant 2-form on S°, provided by the volume (area) form. 
In terms of the unit-vector parameterization of the sphere, this is proportional to 
£i jen’ dnd A dnk , cf. Example A.7. The solution (8.28) descends from the nonzero 
magnetization in the ferromagnetic ground state, corresponding to a nonzero VEV 
of the generator of the unbroken U(1) subgroup. I will give an explicit form of the 
Lagrangian Ean D for ferromagnets in Sect. 9.2. 


In case the symmetry group G is semisimple, its Lie algebra g cannot have any 
nontrivial central charges (Sect. 2.7 of [1]). Equation (8.29) is then the only possible 
solution for the 1-form c(z) and the corresponding Lagrangian oo For G that 
are not semisimple, solutions with nonzero zgp may exist. 


Consider the coset space G/H ~ U(1) x UC)/f{e} = T?. In this case, the 
symmetry group is Abelian so the conditions (8.24) are trivially satisfied. Up to an 
overall factor, there is a unique rank-2 antisymmetric tensor, oap = €qp. According 
to (8.27), this necessarily arises from a central extension of the Lie algebra g 
of G. In the exponential parameterization (8.17), œf (m) = dz“, which leads to 
car) = (1/2)eqpa dr? . This is merely quasi-invariant under the action of G as 
expected. Note that the NG fields x“ are angular variables on the torus T? that 
are only locally defined, and so is therefore the 1-form c(z). What is well-defined 
globally on the whole coset space is only the G-invariant 2-form dc(zr). 


To conclude the analysis of the Lagrangian KA i let us see under what condi- 
tions we can expect it to actually shift by a surface term upon a G-transformation. 
Should Aw D be strictly G-invariant, it would necessarily assume the form 


LOY = -one (nx , (8.31) 


with a constant H -invariant o4, that is fe op = 0. This follows by imposing directly 
the constraint Lg , (Caw?) = 0. The corresponding closed 2-form is then 


1 
de = —oado" = -50a fE NO, (8.32) 


that is, ogy = — fo. Quasi-invariant Lagrangians a D are therefore in a one-to- 
one correspondence with antisymmetric matrices oap satisfying (8.24) that cannot 
be written as og, = — Í, Tc with some H -invariant tensor o,. Such antisymmetric 
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matrices span the so-called second Lie algebra cohomology of g relative to the 
subalgebra h. See Sect. 3 of [4] for a mild introduction and further references. 


In case the symmetry under G is completely spontaneously broken, only the second 
of the conditions in (8.24) survives. In this case, nontrivial solutions for oap, that 
is those that cannot be written as ogg = —f Tc with some oc, are in a one-to- 
one correspondence with the central charges Zap. Both of these are classified by the 
second Lie algebra cohomology of g. 


8.1.4 Overview of the Lowest-Order Effective Lagrangian 


Let me summarize what we have found so far. The basic assumptions I made are 
that an internal symmetry group G is spontaneously broken to its subgroup H, but 
the symmetry under continuous spacetime translations and spatial rotations remains 
intact. Then in d > 2 spatial dimensions, the parts of the effective Lagrangian with 
up to two derivatives of NG fields take the generic form 


0,1 z 
EAN = Ca(a)x? , 


1 1 
LEO = — Fk edo Top (V4 «Vr? — Zico (Teoh (a1 )e"* dpn dsn” , 


1- ee 
Log? = Reavy (og (eH? (8.33) 


Here kab and Kap are constant symmetric H -invariant matrices, 
Fike + ffpkac=0,  ffakcb + fac = 0. (8.34) 
Likewise, Agp is a constant antisymmetric matrix invariant under H, 
faarcb + faphac =0. (8.35) 


The Ag, term can only exist in d = 2 spatial dimensions. In the special case of 
a Lorentz-invariant system, both EA and the Ag, term in a are forbidden. 
Moreover, Kab = Kap under the convention that the speed of light is set to unity. The 


entire effective Lagrangian with up to two derivatives then shrinks to a single term, 
1 
LO = 5 Ked Oe r)a ður n” : (8.36) 


Both a and a are strictly G-invariant. On the other hand, woe 


È 
may be only quasi-invariant. The functions cg(z) on G/H are constrained by the 
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requirement that the 2-form dc = d(cgd“%) = (3cp/In Jdr" A dz? is closed and 
G-invariant. This implies 


de(z) = T saog mof dn" A dx? , (8.37) 
where Gap is a constant antisymmetric matrix subject to the conditions 
faaFed + fpFac= 0, fag Gea + fgcTad + feaFa = O- (8.38) 
Those Lagrangians or that are strictly G-invariant assume the form 


LOY = —opwbr)x*, where f? oy =0. (8.39) 
In this case, cg(7) = —opw? (7) and ogp = — fic: In general, the 1-form c(z) 
is determined by (8.37), hence by oap, up to addition of a closed 1-form, c(z7) > 
c(a) + de(zr). Such a shift only changes the Lagrangian by a total time derivative, 


OCT) . 


dc(zr) 
(0,1) 
apa t= Log t+ 


dt ° 


ZOU Pe” 4. (8.40) 
and so does not affect the dynamics of the low-energy EFT. 

Let me conclude the overview with some remarks on the topological aspects of 
the effective Lagrangian. First of all, the broken part of the MC form w1 is globally 
well-defined on the coset space G/H. This is because it furnishes a coframe, dual 
to the frame built out of the vector fields that realize the action of G on G/H. 
All the parts of the effective Lagrangian (8.33) that are strictly invariant under G 
are constructed out of products of œf (x) with constant tensor coefficients. As a 
consequence, all these parts are themselves globally well-defined on G/H, even 
though their explicit form may depend on the local coordinates x“. In particular the 
Kab, Kab terms correspond to G-invariant Riemannian metrics on G/H. 

The global existence of 2" , on the other hand, is not guaranteed by 
our construction. Problems may arise only if ay is merely quasi-invariant, 
otherwise the same argument as that for a and Zo applies. Quasi-invariant 
Lagrangians a are classified by the second Lie algebra cohomology of g 
relative to the subalgebra 5. Should the group G be compact and connected and 
the subgroup H closed and connected, this relative Lie algebra cohomology is 
isomorphic to the second de Rham cohomology of G/H (Theorem 7.4 in [5]). 
Cohomologically nontrivial oap then ensures that the 2-form dc is closed but not 
exact. As a consequence, the 1-form c(z) cannot be extended from the local 
coordinate patch to the entire coset space. 
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Example 8.5 


Let us contrast the coset spaces G/H ~ G/{e} with respectively G ~ U(1) x UC) 
and G ~ Rx R. These have identical Lie algebras g, h that possess a single generator 
of the second (relative) Lie algebra cohomology, og» = £ap. The single-derivative 
Lagrangian ER = (1/ DVYeapn 4 don” is quasi-invariant under G in both cases. 

Now in the first case, G/H ~ U(1) x U(1) ~ T?. The 2-form dc(z) is 
proportional to the volume form on the torus, and constitutes the single generator of 
the second de Rham cohomology of T?. This agrees with the fact that G is compact 
and connected and thus the two cohomology groups are necessarily isomorphic. The 
Lagrangian Aas is obviously not globally well-defined on T? because the fields 
x’ are not. 

In the second case, on the other hand, G/H ~ R?. In the Euclidean plane, the 
Poincaré lemma holds and the second de Rham cohomology is trivial. Accordingly, 
A a is globally well-defined, since x are now nothing but the two Cartesian 
coordinates in the plane. 


A pedant might object that a Lagrangian that is not well-defined everywhere 
on the coset space renders the whole EFT ill-defined. However, in classical 
field theory, one is largely interested just in the EoM. The latter only involves 
the functions cg(z) through the combinations dcp/dm° — d0cq/ dz, that is, 
only depends on dc(z). Unlike the Lagrangian, the classical EoM is therefore 
globally well-defined on G/H. 

The situation is quite different in quantum theory where we need to be 
able to perform a functional integral over all field configurations on the coset 
space. Luckily, we do not really need the Lagrangian eftir], or even the 
action Sef, to be well-defined. What should be unambiguous is just the phase 
factor exp(iSefp). To ensure this, one needs to cover the coset space with a 
set of coordinate patches and define the action by piecewise integration. A 
detailed analysis is well-beyond the scope of this book. A curious reader 
will find more details and further references in [6], which puts forward a 
modern classification of topological terms in the action using a homology- 
based approach. Here I will just state without proof that consistency and 
G-invariance require that the so-called Manton condition be satisfied: the 1- 
forms tg, (dc) must be exact for all the vector fields § 4 realizing the action of 
G on G/H. This is a stronger condition than G-invariance of dc that I used 
above, which implies merely that l 7 (dc) is closed. The Manton condition is 
automatically satisfied by our AN whenever the first de Rham cohomology 
of G/H is trivial. This is the case for instance when G is compact and simply 
connected and H is connected; cf. Example A.28. 
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8.2 Effective Lagrangians from Background Gauge Invariance 


I this section I will switch gears and outline another approach to the construction 
of effective Lagrangians for SSB. This is based on a technical assumption that rules 
out some of the more exotic theories covered by the method of Sect. 8.1, notably 
systems with a central extension of the symmetry algebra. The reward for making 
this sacrifice is a drastic simplification of the classification of possible contributions 
to the effective action. All we shall need will be elementary field theory with no 
recourse to differential geometry. The method presented here was pioneered by 
Leutwyler [7,8], and I will largely follow the pedagogical account of [9]. 

The generating functional formalism constitutes an arsenal of important tools 
in both classical and quantum field theory. In this framework, one couples a 
given theory to a set of classical external (or background) fields and subsequently 
integrates, if only formally, over the dynamical degrees of freedom. The physical 
properties of the theory are encoded in the ensuing generating functional of the 
background fields. See Chap. 16 of [10] for an introduction to the formalism. 

In principle, one has the freedom to choose the background at will. However, in 
presence of a continuous internal symmetry group G, it is convenient to introduce 
a set of background gauge fields A4, one for each generator Q4 of G. These can 
be clustered into a matrix-valued gauge field A, = Ai Q a. The important technical 
assumption I am making here is that the symmetry under G is gaugeable. That is, 
it is possible to add the gauge fields so that the generating functional W{A} of the 
system? is invariant under the gauge transformation 


Ay > TA = 8Apg! +igðpg! . (8.41) 


Example 8.6 


Consider a theory of a set of (not necessarily scalar) fields ø’, whose Lagrangian 
density -Z [¢] is strictly invariant under a linear representation of the symmetry 
group G, ¢! = R(g)' „pÏ. Such a Lagrangian can always be made invariant under 
the simultaneous local ‘transformation of ¢! and (8.41). All one has to do is to replace 
derivatives of øt with gauge-covariant derivatives, 


dup > [biðu —iR(Ay)' Jo! - (8.42) 


In order to assert gauge invariance of the generating functional W{A}, one still 
needs to check that the functional integral measure does not change upon the local 
transformation of ¢’. That would indicate an anomaly. In general, the assumption of 


3] follow [7,8] and use braces to indicate the arguments of a functional. Square brackets are 
reserved for local functions of fields and their derivatives. 
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gauge invariance of W{A} essentially amounts to the absence of anomalies or other 
obstructions to gauging. 


A linear realization of symmetry is typical for microscopic field theory models. 
There, one can usually check explicitly that the assumption of gauge invariance of 
the generating functional is satisfied. The low-energy EFT for the NG bosons should 
then, when coupled to the same external fields, reproduce the same generating func- 
tional. In other words, we expect that the effective action Ser{z} = f dPx Lest | 
can be replaced with a “gauged” action Seg{z, A} = f dPx Le¢{, A] such that 


el WIA} — J Dr exp[iSer{, A}] . (8.43) 


Leutwyler [7] showed that gauge invariance of W{A} is sufficient to ensure that 
the effective action Se¢{2, A} is invariant under the simultaneous local transfor- 
mations (8.2) and (8.41) of the NG and gauge fields. The locality of (8.2) is 
implemented by allowing g therein to depend arbitrarily on spacetime coordinates 
as in (8.41). 


It is common to gloss over this detail and simply assume right away that 
the effective action is G-invariant. I did the same in Sect. 8.1. It is therefore 
worth stressing that this is a consequence of the properties of the generating 
functional, established at the microscopic level. The proof is technical and I 
refer the reader to [7] for details. 


We have replaced the problem of classifying possible G-invariant actions Se¢¢{z} 
with that of constructing locally G-invariant actions Serr{7, A}. How does that make 
our task easier? In short, a lot. I will now show in a series of simple steps how 
the problem of finding all possible gauged actions Serr{7, A} can be reduced to an 
exercise in group theory. If desired, one can then always discard the background 
gauge fields and thus recover (most of) the results of Sect.8.1. But the gauged 
action in fact carries more information, as it tells us how the EFT responds to certain 
external perturbations. 


8.2.1 Methodology of Construction of Effective Actions 


In the first step, we exploit the gauge invariance to eliminate explicit dependence of 
the action on the NG fields. Namely, by setting g = U()~!, we get 


Serr{z, A} = Sers{0, Ty ¢-1 A} - (8.44) 
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By combining (8.2) and (8.41), we find that under the local action of G, the 
composite field Ty r)-! A, transforms as 


& 
Tuay Ap > Tying)! Tg Ap = Tha, g) Tua) Au - (8.45) 


This is a special case of a local transformation of Ty (x)-! A, from H. Thus, gauged 
actions Seff{7r, A} locally invariant under G are in a one-to-one correspondence with 
locally H-invariant functionals of Ty (7)-1 Ap. 

Let us have a closer look at this composite field. By (8.41), 


Tya) Ap = U) ALU) + Ut)" ,U Or) nae 
= iU (x) | (ð, — iA )U (1) . 


Up to an overall sign, this is just a gauged version of the MC form (8.5). We can 
make the analogy explicit by turning the gauge field into a 1-form, A = A dx“, 
and defining the gauged MC form viat 


Q (r, A) = Q4 (r, A)Q adx" = -Tya A = —iU (x) (d — iA)U (1) . 
(8.47) 


This g-valued 1-form can again be split into unbroken and broken parts, 2 = Q + 
Qı. These inherit the transformation rules (8.9) under the local action of G, 


Qy (re, A) > QG, AY) = hC, g)Qj (x, Ahr, 8)! — ihr, g)dh Cr, g)", 
Qi (ot, A) > Q1, A) = h(a, 8) 0r, AAC, g)! . (8.48) 


The second step is to realize that we can temporarily forget about the origin 
of Q), Q1 in terms of the NG fields x° and gauge fields A. For the purposes of 
constructing the effective action, all we need is that — (2) transforms as a gauge field 
of H, whereas Q transforms linearly under the adjoint action of H. Once a locally 
H-invariant action Segf{S2y, 21} has been found, we can reconstruct the dependence 
on 7%, AT using (8.47). Now define two new vector fields by taking a variation of 
the action with respect to 2 and 21, 


bSefe{Q2y, 21} 
Sag 


Sef {2 21} 


JEQQ, 21] = 
æ l2 Q1] 524 


; E i QL] = (8.49) 


4 This is a slight abuse of notation. The gauge field A is a 1-form on the spacetime. In order for 
Q (zr, A) to be a well-defined 1-form on the spacetime as well, the MC form œ (7x ) has to be pulled 
back by the map x° : x > z" (x). 
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Fig. 8.1 Integration path in 
the space of parameters u, v 
(shown by the oriented solid 
line), used to reconstruct the 
effective action from its 
partial derivatives (8.50) 


In spite of the different transformation properties of (2) and (2, , both J& and =F 
transform linearly under the adjoint action of H. This is just an auxiliary statement, 
and I therefore refer the reader to Appendix C of [9] for a detailed proof. The main 
message is that to construct such covariantly transforming objects is straightforward, 
and once we have done so, we can reconstruct the action. 

To that end, introduce real scaling parameters u, v. It follows from (8.49) that 


Qy, vQ 
PSU VO) L f 4P, ocx uy, vO 10) 
u 
E (8.50) 
A f dP 24 EL UNY, VAM) . 
U 


Setting without loss of generality Seff{0, 0} = 0, the full action is then obtained by 
integration along the path in the u v space, shown in Fig. 8.1, 


1 
Setr{Qy, QL} = f ax f du Q% (x) JE uR, 0) 
i (8.51) 


1 
+ forf dv 24 (x) EK [Qy, vALI(x) . 


Equivalently, the effective Lagrangian -Zepr[Qy, Q1] can be split into two pieces, 
Ll, 21] = Les[Qy] + Anv[Qy, 21], such that 


1 
Les[Qy] = i) du On Ig UR, 0], 
: (8.52) 
Linv[Qy, Q1] = f dv QF EFQ, v2]. 
0 


As a consequence of the linear transformation properties of Q4 and E}, the part 
Zin (Quy, Q1] is strictly invariant under the local action of G. Possibly quasi- 
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invariant contributions to the effective Lagrangian reside in the part Ycs[Q)], which 
only depends on the h-valued gauge field — 92]. I will refer to this part as Chern— 
Simons (CS) due to its resemblance of the CS theory with gauge group H. 

The strategy to reconstruct the effective Lagrangian is now as follows. The 
gauge-invariant part of the Lagrangian, -Zipy[Q), 21], can be built directly out 
of Qt the field-strength of —Q*, Giv = —d,Q5 + dy 2i + fiy of QY, and their 
covariant derivatives. This is common lore but if desired, a detailed proof can be 
found in Appendix D of [9]. In the same way, one can construct possible gauge- 
covariant “currents” Jf’, except that now the available building blocks are only 
Gv, and its covariant derivatives. In both cases, products of the covariant building 
blocks must be contracted with tensor coefficients that ensure invariance under the 
residual linearly realized symmetries (spatial rotations and the unbroken subgroup 
H). Finally, the CS part of the Lagrangian is obtained from the current Jf’ by 
integration over u as indicated in (8.52). Here the quasi-invariance under H must be 
checked explicitly and may impose further constraints on the tensor couplings. 

Up to any desired order in the derivative expansion, only a finite number of 
covariant operators can contribute to Liny[Qy, Q1] or Je [Qq], and they can be 
enumerated by inspection. See [9] for a full list up to order four in derivatives. The 
problem of constructing all gauge-invariant actions then boils down to solving the 
group-theoretic linear constraints on the tensor couplings of the individual operators. 
The full dependence on the fields 74, A is already completely fixed by the structure 
of the operators. The practical use of this algorithmic procedure is best illustrated 
on the sample analysis of effective Lagrangians with up to two derivatives, worked 
out below. 


8.2.2 Lagrangians Up to Order Two in Derivative Expansion 


For the derivative expansion to be consistent, the Lorentz index on Ail has to count 
equally to the derivative 0/dx". Each of 2%, Qi, is then of order one in (spatial or 
temporal) derivatives. Up to order two, the invariant Lagrangian Zjny[Qy, Q1] may 


contain the following contributions, 


i (order 1), i Q?, DQ, Gis (order 2) , (8.53) 
where Du Q$ = 3u Q$ + ilu, QLy]*. However, a G-invariant Lagrangian density 
built out of D, Q$ alone would be a bans surface term. Similarly, a term of the type 
Ca Giv would boil down to cy Tins ty the rest of Giv being a surface term. But 
the combination cy f By would necessarily vanish due to the H-invariance condition 
imposed on the coefficient cy. Altogether, upon adding the appropriate tensor 
couplings, the most general gauge-invariant and rotationally invariant Lagrangian 
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up to second order in derivatives is 
a l1- aob 1 rso4ob 1 rsoaob 
Ziny[Qy, Q1] = — oa R0 + 5 Kab &2 2 = 5 Kavd QF Qs — a hav€ QQ: . 
(8.54) 


From (8.48), the coefficients og, Kab, Kab, Aab must be invariant under the adjoint 
action of H, that is satisfy the constraints 


flab =0, fkadcb + fopdac =0, 


foakeb EE Fapkac =0, foakcb + FopKac =0. 


(8.55) 


In order to pin down -Zcs[Q|] up to second order in derivatives, we need Je [2] 
up to order one. But the basic building block available, G% ,„, already has order 
two. The only possibility left therefore is a constant current, restricted by rotation 


invariance to JEL] = —56 Ou. By (8.52), this leads in turn to 
Les[Qy] = -0a 20 (8.56) 
as the sole possibility. Quasi-invariance under H imposes a constraint on oy, 
Sagay =0. (8.57) 


Note how effortlessly we recovered the effective Lagrangian, obtained in 
Sect. 8.1 and summarized in (8.33). The only difference is that out of all the 
possible solutions for the ca (zr) functions, those found here correspond to vanishing 
VEV of the central charges Zap in (8.27). This class of solutions is selected by the 
requirement that the symmetry under G is gaugeable. 

However, our newly deduced effective Lagrangian consisting of (8.54) and (8.56) 
does not just reproduce the previously found results. It also tells us how the NG 
fields couple to the background gauge fields A i: To see how the Lagrangian depends 
on the latter, we combine the definition (8.47) with (8.7) and (8.8) to write 


Qi (r, A) = w (T) — Avi (1) Oa = wf (m )[dr’ — A4E! (t)]Qa. (8.58) 


This makes perfect sense: the vector fields £ 4 define infinitesimal group motions on 
the coset space. As a consequence, 


Dun? = dyn" — AGES (T) (8.59) 


is the correct definition of a derivative of the NG field, covariant under local 
G-transformations. Altogether, the effective Lagrangian up to second order in 
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derivatives can then be written as 


LOY = ogwr" + cave nyAe , 
(2,0) 1 c d a b 1 c d rs a b 
Lf = — z edw (1), (1) Dr - Dx — z^cawgCOr)wp (rE D, z! Dyr 
1 : 
a 2) = zkeatvs (ars (1) Dow" Dor” f (8.60) 
Example 8.7 


A chemical potential describing a statistical many-body state of a system can be 
introduced into its Fograngiah ” a constant temporal gauge field (see Chap. 2 
of [11]). Replacing Aj —> bn0 u^ in (8.60) therefore allows us to analyze the effect 
on the EFT of a set oft chemical potentials u^. In particular, one can thus determine 
the ground state triggered by the chemical potentials as well as the spectrum of NG 
modes above it. A detailed discussion can be found in [12], where this setup was 
used to pin down the spectrum of massive NG bosons, introduced in Sect. 6.4.3. 

In spin systems, the chemical potentials u^ for the generators of G ~ SU(2) can 
be interpreted as the components of an external magnetic field B. This is because 
the magnetic field couples to the conserved charge of SU(2): spin. P the help 
of Example 7.14, one can easily check that the second term in Z te in (8.60) is 
proportional to B -n. This is the Zeeman interaction of spins with the magnetic field. 
Its normalization is fixed by o4, hence by the magnetization of the ground state. 


8.2.3 Effects of Explicit Symmetry Breaking 


So far in this chapter, I have assumed the symmetry of the EFT to be perfect. 
There are however good reasons to consider the effects of (presumably small) 
perturbations breaking the symmetry explicitly. First, I already used this idea in 
Sect. 5.2 to isolate a unique ground state in presence of SSB. Second, symmetries of 
real physical systems are almost always just approximate, even if to a high precision. 

I will not attempt a general analysis of explicit symmetry breaking in the 
EFT, but rather focus on a special case that is commonplace and admits a very 
simple treatment. Suppose that the microscopic Lagrangian of the system contains 
a contribution moO? where mọ are real constant parameters and O° a set of local 
operators. Suppose these operators transform under some (real) linear representation 
R of G, 


0? 5 OP =R(g)?,0°, geG. (8.61) 
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Such a perturbation violates the invariance of the action under G unless mg = 0 for 
all the components O° that belong to a nontrivial irreducible representation of G. 
In order to understand how the effects of the perturbation propagate to the 
low-energy EFT, it is convenient to promote the parameters mọ to background 
fields. The generating functional W{A, m} then remains exactly invariant under the 
simultaneous local transformation 
Au > TeAu = gAug | +igdug |, mo > m, =R(g7!)? mo. (8.62) 
g g 
This translates to the invariance of the gauged effective action Seg{z, A, m} under 
a simultaneous transformation of 74, Ai, Mo, generalizing (8.44) to 


Setet, A, m} = Sepe{O, Ty q)-1 A, MR(U (7))} . (8.63) 


The action therefore depends on x“, Ai and mọ only through two composite fields, 
Ty)! Áu and So (7, m) = RU (T) Mmo. The field Eg (7, m) transforms under 
G through a linear representation of H, 


Bo(,m) È So(n',m’) = RU (w')) gm’, = RU (1) RA, g) 1)” gz 


c= 


= R(h(x, g)')" Eo (a, m) . (8.64) 


By extension of the argument in Sect. 8.2.1, we can localize & in the strictly 
invariant part of the Lagrangian. The complete Lagrangian then takes the form 


Lett, A, m] = Les[Qy] + Fny[Qy, Q1, E]. (8.65) 


Exactly which operators containing Eg should be included at a given order of 
the derivative expansion depends on how we decide to count Eg. It does not contain 
any derivatives of NG fields but, being proportional to the perturbations mọ, it makes 
sense to treat it as small. Depending on the concrete system, only a finite number of 
operators including Eg is then needed at any fixed order of the derivative expansion. 
The leading perturbation of the effective Lagrangian is linear in Ep, 


perl, m] = n° Bo (7, m) . (8.66) 


The effective coupling 7° must be H-invariant, which amounts to R(Qa)fo n? = 0 
for any generator Qa of H. A more complete list of operators containing E, and 
contributing to the effective Lagrangian can be found in [9]. 


Example 8.8 


The choice of background fields and their transformation properties is up to us. To 
illustrate this freedom, note that it is possible to treat the gauge fields A‘ themselves 
as linear perturbations. In this case, the operators O? are the Noether currents J A 
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of the symmetry group G. These transform according to the dual of the adjoint 
representation of G, 


JE S Pa Rg) BT, (8.67) 


where & is the adjoint representation itself. The gauge fields Ail are now introduced 
to the microscopic Lagrangian through the linear cous Aa Ja K. Equation (8.62) 
assigns them the transformation rule At 4 R(g)4 5 AB which amigas to discard- 
ing the derivative piece of the palison gauge ransfonmation (8.41). Follow- 
ing (8.63) then leads to the composite field 


BA (mr, A) = RU (m) H g AB = vg AR . (8.68) 


The leading contribution of such a perturbation to the effective Lagrangian is then 
nav (m)A. The coupling 74 must be H-invariant, that is fêna =0. 

We have successfully recovered the structure of the second term in Z oo 
in (8.60). Note that treating A‘ as a linear perturbation leaves the effective ae 
na unfixed. One could in principle apply the same reasoning case by case to 
other operators in the effective Lagrangian containing A However, imposing 
full background gauge invariance is obviously much more efficient, as it fixes the 
dependence of the EFT on the gauge fields entirely without any free parameters. 


8.2.4 Coupling to Matter Fields 


Throughout the whole book, I mostly assume that the NG bosons are the only 
low-energy degrees of freedom present in the given system. This is justified in 
case any other, non-NG modes in the spectrum possess a gap. The validity of 
the low-energy EFT for NG bosons is then limited to energies well below this 
gap. However, there are physical systems where strictly gapless non-NG degrees 
of freedom naturally occur. One generic possibility is that the spectrum includes 
a Fermi sea of particles such as electrons, protons, neutrons, or quarks. Any local 
low-energy EFT must then necessarily include such additional gapless degrees of 
freedom. It is therefore worthwhile to digress and see how such modes fit into the 
EFT framework developed in this chapter. 

The question how non-NG fields (also called matter fields) transform under the 
nonlinearly realized symmetry was already resolved in Chap. 7. Namely, any set of 
(not necessarily scalar) matter fields x? can without loss of generality be assumed 
to transform under some linear representation D of the unbroken subgroup H. The 
action of the whole group G is then given by 


x2 Š x(x. mg) = D(C, 8))25X° , (8.69) 


alongside (8.2) which defines the matrix h(x, g) € H. 
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We are now looking for the most general effective action Seg{az, A, m, x} 
consistent with the background gauge invariance under G. Our basic trick has 
been to eliminate explicit dependence on the NG fields by performing a gauge 
transformation with g = U ()~!. For this transformation, h(z,U()~!) = e, 
hence the matter fields x} remain unaffected. The effective action can therefore be 
built out of the composite fields Ty (,)-1 Ay, and Sg (zr, m), and x°. The rest is just 
group theory. Since the matter fields may also enter the Lagrangian with derivatives, 
it is useful to have at hand their covariant derivative, 


Dax? = ðu x! + IQD (Qa) o x” : (8.70) 


The transformation rule (8.69) is not necessarily the only possible choice 
for the action of G on matter fields. Suppose that the (presumably 
reducible) representation D of H can be extended to a linear representation 
of the whole group G on the same set of fields x°. The redefinition 
xe —> V? = D(U(m))®,x° then gives variables that transform linearly 
under the whole G, Y? > D(g)°o Y7. Such fields, while superficially nat- 
ural, however conceal the physical structure of the spectrum. The degenerate 
energy levels are still organized in multiplets of the unbroken subgroup H. 
States from different multiplets of H, even if formally belonging to the 
same multiplet of G, will have different dispersion relations, and different 
interactions. In the extreme case, it may not even be possible to form complete 
multiplets of G. The arguably more complicated nonlinear transformation 
rule (8.69) is then the only option. An explicit example will illustrate this 
best. 


Example 8.9 


Most natural ferromagnets are metals, which betrays the presence of gapless, 
conducting electrons. While their appearance in spin doublets would seem natural, 
this is no longer mandatory once the G ~ SU(2) spin symmetry is spontaneously 
broken down to H ~ U(1). Let us denote the two possible polarizations of the 
electron as up (fî) and down (|). Each of these carries a one-dimensional complex 
representation of H. The action of the full spin group G is then defined in accord 
with (8.69), 


Xp > Dyhl, gxr, XY > Dy (h(a, 8) xy - (8.71) 
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In the effective Lagrangian, Schrddinger-type terms for the up and down electrons 
can be added independently from each other, 


. D? 
La > cox) (ino +=— + no) Xot. (8.72) 
an 2mo 


Here mo and uo are the effective mass and chemical potential of Xo , and the ellipsis 
denotes other possible operators, compatible with all the assumed symmetries. The 
covariant derivatives induce coupling between the electrons and the magnon (spin 
wave) degrees of freedom. There is nothing that prevents us from setting one of c+, 
c, to zero: we do not need both spin polarizations to preserve the invariance under 
SU(2). Physically, this amounts to the possibility of magnons interacting with a 
fully polarized Fermi sea of electrons or other spin-1/2 fermions such as neutrons. 


Another important application of the formalism for coupling matter fields to NG 
bosons is the description of interactions between baryons and pseudoscalar mesons. 
The latter are pseudo-NG bosons a of spontaneously broken symmetry of quantum 
chromodynamics. Baryons play the role of matter fields that are much heavier than 
the NG modes, but may nevertheless be added to the low-energy EFT if desired. An 
interested reader will find more details in Chap. 4 of [13]. 


8.3 Equation of Motion 


For many applications, it is useful to have at hand the EoM for the NG bosons. 
This is of obvious value in classical physics, it is however also very useful in 
quantum field theory. First, the linearized EoM can be used to identify the spectrum 
of excitations above the ground state and their perturbative propagators. Second, 
within a derivative expansion of the EFT, knowing the leading-order EoM helps to 
eliminate redundancies in the effective Lagrangian at higher orders. 

Let us return to the general effective Lagrangian (8.33) without background 
gauge fields, this time augmented with the perturbation (8.66). Deriving the EoM 
from the effective action amounts in principle to a mere variation with respect to 
the NG field 7“. However, the complicated dependence of the Lagrangian on m° 
makes this a rather odious task. I will therefore make a rare exception and leave out 
all details, simply displaying the final result. A reader wishing to verify it should be 
prepared to a repeated use of the MC equation (8.11) and of the algebraic constraints 
on all the effective couplings. 

I will use the shorthand notation w^ (2) = os (x)ð T". Also, I will need the 
covariant derivative of the broken part of the MC form, 


D of (a) = apol) — ffo (70 eo} (a0) . (8.73) 
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With these preparations, the EoM descending from the lowest-order effective 
Lagrangian (8.33) can be written as 


b = b d= b S b d b 
Oabw) — (Kab Dow = FinKcd po) F (Kapd'* Do, = ATO ws) (8.74) 


1 , 
+ a fihea + fhad + fla) oP os + myR(U T RGQa) n= 0. 


This does not appear particularly elegant, but it is not that bad. In concrete 
applications, one or more of the terms in (8.74) are often missing. The last term 
obviously vanishes in the limit of exact symmetry. The term proportional to e”* is 
absent in d 4 2 spatial dimensions. Moreover, all the terms containing a structure 
constant with three broken indices, ff, or similar, drop for symmetric coset spaces 
(see Sect. 7.3.2). One last special case, in which the form of (8.74) drastically 
simplifies, deserves spelling out explicitly. 


Example 8.10 


Consider the class of EFTs for Lorentz-invariant systems. This amounts to setting 
Kab = Kap as well as to dropping the single-time-derivative term and the purely 
spatial two-dimensional term proportional to e”*, 

b d L ; 

Kab8”” Dupoy — fapKed8"” 0p — MRU (T) RQ) n =0. (8.75) 
Under the additional assumption that the coset space G/H is symmetric, which is 
the case for many relevant physical systems, the EoM takes the extremely compact 
form Kapg"” Do? -Mo R(U (T) o RGÖQa) ¢ n7 = 0. This is essentially a nonlinear 
generalization of the Klein—Gordon equation to symmetric coset spaces. 


A couple of remarks on the main result (8.74) are due. First, the last term 
proportional to mg looks like it might contribute a nonzero constant in the limit 
mz” — 0. That would indicate an instability of the origin of the coset space under 
the symmetry-breaking perturbation. This is hardly surprising; in presence of the 
perturbation, we can no longer expect all the points of the coset space to correspond 
to physically equivalent vacua. In order to ensure that 7“ = 0 is at least a stationary 
point of the potential induced by the perturbation, one has to demand that such a 
constant term in (8.74) is absent. That amounts to the condition 


mpR(Qa)?gn” =0. (8.76) 


Second, the contribution of the two-dimensional term proportional to e’* van- 
ishes if ff Aca + ffad + f4Ana = 0. This implies by means of (8.24) that the 
2-form (1/2)Agpw" (1) A o? (zr) is closed, hence contributes a mere surface term to 
ae. This makes sense, since surface terms do not affect the EoM. 
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What if we want to know how the EoM depends on the external gauge fields 
AD? It might appear we could obtain a gauge-invariant EoM by replacing 
everywhere in (8.74) Of and On with their gauged counterparts (24 , On Yet, 
there are subtleties hidden in this naive guess. First, we know from Sects. 8.1 
and 8.2 that there are admissible values of og, for which the symmetry 
under G cannot be gauged. The catch is that not every modification of (8.74) 
renders it a well-defined variational equation. That is, it may not be possible 
to construct a Lagrangian bottom-up from the gauged EoM. It is not difficult 
to show that consistency of the gauged EoM and background gauge invariance 
of the corresponding action require that oap satisfies the constraints (8.28). 

Second, it is not obvious that even when the symmetry under G can be 
gauged, the minimal replacement of œw; and œ, with Q7, and 9%, gives the 
correct gauged EoM. Repeating the derivation of the EoM but this time 
starting with the gauged Lagrangian (8.60), one finds instead of (8.74) the 
following, 


= fipa Q — (Kab DON — Sipke 242%) + (kab D, QF 
1 
= fapkodð GUY) + 5 (Sapca + førhad + fegàba) YA (8.77) 


= shane! [UC FesU CO) +M RU (T) RUQa) n° = 0. 

Here Fy» = 0,Ay — Ap — i[Ay,, Av] is the field-strength tensor of 
A,. The slight modification of the first term agrees with the fact that the 
symmetry under G can only be gauged if oap = — fS oc. Most importantly, 
however, there is an extra term that could not have been guessed by the 
naive gauging of (8.74). This has interesting consequences. Even when the 
2-form (1/2)Agpw" (T) A œ? (x) is closed, thus contributing a mere surface 
term, gauging it leads to a nontrivial modification of the Lagrangian as well 
as the EoM. 


8.3.1 Spectrum of Nambu-Goldstone Bosons Revisited 


It is common to analyze the excitation spectrum by expanding the Lagrangian to 
second order in fluctuations around the ground state. Here I will take an alternative 
route to illustrate the utility of the EoM. Namely, I will linearize (8.74), that is 
expand it to first order in the NG fields 2“. For the sake of simplicity, I will discard 
all external fields including the perturbations mg. By invoking the exponential 
parameterization (8.17) in which œ; (0) = 67, we get at once 


Gabh? — Kab? + Kap Vn? XO. (8.78) 
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The © symbol reminds us of the linearization done. This has plane-wave solutions 


mw (x,t) = RleE eles (8.79) 
where the amplitude 7“, energy E and momentum p satisfy 
—iEo gpk? + E? Kap? — p kap? =0. (8.80) 


We would now like to understand how the dispersion relations of the various 
NG modes are related to the matrices Kap, Kap and oap. From Sect. 6.3.2, we expect 
to find type-A and type-B NG bosons. I even declared therein that the dispersion 
of type-A NG modes is linear in momentum, whereas that of type-B NG modes 
is quadratic. We are now in the position to justify this claim. First, one can always 
change the basis of the variables 7“ so that kap = ĉap. Moreover, ogy can be brought 
by an additional orthogonal transformation to a block-diagonal form, 


0 0712 
021 O 


Oab = 


(8.81) 


The upper-left block & corresponds to the type-B sector; these are the fields that 
enter the bilinear part of ae . The lower-right block will analogously correspond 
to the type-A sector. The numbers of the different types of modes agree with our 
previous counting rule (6.24). 

One last simplification we can make is to diagonalize the lower-right part of 
Kab by yet another orthogonal transformation without spoiling the already reduced 
forms of Kap and ogy. Thus, Kap can be assumed to take the generic form 


, (8.82) 


Kab = 
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where A is symmetric and A is positive-definite and diagonal. Altogether, the 
spectrum of NG bosons is obtained by imposing the condition that the determinant 
of the matrix of coefficients in (8.80) vanishes, 


_ 24. 22 2 
if a ee) 2h, (8.83) 
E?B |E7A — p 


The exact solution E (p) for the dispersion relation of the various modes will be very 
complicated. However, the asymptotic behavior of the dispersion in the limit of low 
momentum is easy to extract from (8.83). Namely, in this limit, the contribution 
of the off-diagonal blocks can be neglected, as can be the E7A term in the upper- 
left block. The asymptotic dispersions in the type-A and type-B sectors therefore 
descend from the already diagonalized matrix equations 


E?A— p> 0. (type-A), iEX + p?~0 (type-B) . (8.84) 


The presence of both first and second time derivatives in the type-B sector 
indicates that (8.83) has solutions E(p) with nonzero limit as p —> 0. 
Taken at face value, the linearized EoM (8.78) thus predicts the existence 
of gapped modes, accompanying type-B NG states. The required balance 
of operators with one and two time derivatives may however violate the 
derivative expansion of the EFT. In general, the presence of such gapped 
partners of type-B NG bosons cannot be asserted from the symmetry-breaking 
pattern alone. 


8.3.2 More on the Geometry of the Coset Space 


We had a first look at homogeneous spaces from the point of view of differential 
geometry in Sect. 7.4. I showed that any coset space G/H possesses a collection 
of G-invariant (pseudo-)Riemannian metrics. In this chapter, we found a use for 
them: two such metrics, g(x) and g(x), enter the part of the effective Lagrangian 
for NG bosons with two derivatives. In Sect. 7.4, I also introduced a class of affine 
connections on coset spaces. I have not made use of the ensuing curvature and 
torsion so far. However, we will see in Chap. 10 that these appear naturally in 
scattering amplitudes of NG bosons. 

What I want to briefly discuss now is yet another geometric structure on 
homogeneous spaces that is intimately connected to the spectrum of NG bosons. The 
reader might, if needed, want to recall the contents of Sect. 4.3.1 before proceeding. 
The exposition below follows [2], to which the reader is referred for a much more 
thorough discussion of geometric and topological aspects of type-B NG bosons. 
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Suppose that the spectrum of NG bosons were purely type-B as in ferromagnets. 
The matrix oap must then be nonsingular. The same applies to the 2-form dc(zr) that 
defines the part of the effective Lagrangian with one time derivative, 2. Being 
simultaneously closed, this 2-form therefore establishes a symplectic structure on 
the coset space. The 1-form c(z) is the eotresponciar he eae potential. The 
NG fields that block-diagonalize the bilinear part of ga as in (6.23) constitute a 
set of (local) Darboux coordinates. The existence of a symplectic structure on G/H 
underlines that it should be treated as the phase space of the EFT. The EoM is of 
first order in time derivatives. The number of independent modes in the spectrum 
equals (1/2) dim G/H. 

What if the spectrum includes both type-A and type-B NG bosons? Then the 
2-form dc(z) becomes singular, yet it still defines a new structure on G/H, 
called presymplectic. Intuitively, the coset space becomes partially a phase space 
and partially a configuration space. The paired NG fields giving rise to type-B 
NG bosons are “phase space coordinates.” These are augmented with additional 
“configuration space coordinates,” corresponding to the type-A NG fields. The 
dynamics of the former and the latter is respectively of first and second order in 
time. 

Let us try to be a bit more precise while remaining physically intuitive. Suppose 
that the commutator matrix (8.26) were the sole order parameter our system 
possesses. This would break the symmetry group G to some subgroup K. Suppose 
also that G is compact and that its Lie algebra g does not have any nontrivial central 
charges (for instance because G is semisimple). According to the discussion in 
Sect. 6.3.2, our order parameter then corresponds to the VEVs of a set of conserved 
charges that belong to a Cartan subalgebra of g. These charges together generate an 
Abelian subgroup T C G, usually called a torus. In group theory terminology, the 
subgroup K = {g € G| gh = hg Wh € T} is the centralizer of the torus T in G. 

The NG modes owing their existence to the order parameter og, should be 
described by an EFT that lives on the coset space G/K. A coset space G/K where 
K is the centralizer of a torus in G is called a flag manifold; see Chap. 7 of [14] for 
an introduction. Flag manifolds are known to carry a natural symplectic structure, 
corresponding to (8.23) with our order parameter oap. In physics terms, the spectrum 
of the EFT on G/K contains by construction only type-B NG modes. Their number 
is (1/2) dimG/K. 


Example 8.11 


Consider G ~ U(n) and the torus T generated by the diagonal matrices 


(Ate 0 ) (8 85) 
0 [A2Len-m)xn-m) : 


where 1,2 € R. The centralizer of this torus in G consists of all block-diagonal 
unitary matrices, K ~ U(m) x U(n—m). The flag manifold U(n) /[U@m) x Um—m)] 
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is called the (complex) Grassmannian. The special case of m = | is the complex 
projective space, CP” ~ U(n + 1)/[U(™) x U(1)]. 


How do we add the type-A NG bosons then? In presence of additional order 
parameters, the symmetry will be broken further down from K to H. This will lead 
to additional dim K/H modes, which we can identify with the type-A NG bosons. 
All this of course does not mean that we will have two different EFTs, one living 
on K/H for the type-A NG bosons, and the other living on G/K for the type-B 
NG bosons. The two coset spaces are geometrically integrated into G/H through a 
fiber bundle structure, K/H —> G/H 4 G/K. Ignoring the type-A NG degrees 
of freedom amounts to the projection x : G/H — G/K from the total space G/H 
to the base space G/K. The type-A NG fields span the fiber K/H above each point 
of the base space. The presymplectic structure on G/H is obtained by pulling back 
the symplectic 2-form on the base space G/K via the projection map zr. 


Example 8.12 


Consider a theory with a G ~ U(n) symmetry. Suppose that the symmetry is broken 
down to H ~ U(n — 1) by the expectation value of a complex scalar field ® that 
transforms in the fundamental representation of G. Since the action of G preserves 
the norm of ®, the coset space U(n)/U(n — 1) is obviously equivalent to $2”—!. For 
convenience, we can choose the order parameter as (®) = (1,0,..., 0). The Lie 
algebra g of G includes two linearly independent singlets of H, Qı = 1 and Q2 = 
diag(—n + 1,1,..., 1). Their linear combination, Qj = [Q2 + (n — 1)Q)]/n = 
diag(0, 1,..., 1), generates the sole U(1) factor of H. The generator Q; is a singlet 
of the whole group G and its VEV therefore does not break the symmetry. On the 
other hand, the VEV of Q2 constitutes a candidate order parameter responsible for 
type-B NG bosons in the spectrum. See the closely related Example 6.6. 

If (Q2) = 0, or (Qq) = (n — 1)/n(Q1), we have no order parameter, hence 
K ~ G. In this case, the coset space G/H is pure type-A; there are 2n—1 type-A NG 
modes. If, however, (Q2) 4 0, there are n — 1 type-B NG modes that parameterize 
the coset space G/K ~ U(n)/[U() x U(n — 1)] ~ CP”7!. Above each point of 
this base space, there is a fiber K/H ~ U(1), carrying one type-A NG degree of 
freedom. 


5 Since this is the only place in the book where the concept of a fiber bundle appears, I have decided 
not to include a detailed explanation in Appendix A. I hope that even a reader without the necessary 
mathematical background can extract some useful information from the discussion. 
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The material of Chaps. 7 and 8 is in principle sufficient to construct the low-energy 
effective field theory (EFT) for any system with a spontaneously broken internal 
symmetry. However, the implementation of the general methodology for a concrete 
physical system often still requires a nontrivial amount of effort. Moreover, the 
actual phenomenology of the EFT may depend sensitively on the internal and 
spacetime symmetries present. In this chapter, I will therefore work out in detail two 
applications of the general formalism, one in particle physics and one in condensed- 
matter physics. Apart from serving as an extensive illustration, this will allow us 
to dive deeper into some technical details of EFTs for Nambu—Goldstone (NG) 
bosons: the consistency of the derivative expansion of the effective Lagrangian, and 
the topological aspects of quasi-invariant Lagrangians. 


9.1 Chiral Perturbation Theory of Mesons 


Historically, the development of EFT for NG bosons was largely motivated by the 
need for a phenomenological description of low-energy hadron physics.! On the one 
hand, computation of hadron properties from first principles is challenging because 
the strong nuclear interaction is not amenable to a perturbative treatment at energies 
below the intrinsic scale of quantum chromodynamics (QCD). On the other hand, 
the spectrum of lightest hadrons exhibits scale separation; see Fig. 9.1. The eight 
lightest particles in the spectrum—the three pions, four kaons and the 7-meson— 
are all pseudoscalars with varying quark flavor composition. Their masses are much 
lower that what one would expect given the constituent mass of about 300 MeV for 
the u,d quarks and nearly 500 MeV for the s quark. This suggests that the low- 
energy physics of the light pseudoscalar mesons should be captured by an EFT. 


' This is also reflected by the widespread terminology. In the high-energy physics literature, it is 
still quite common to refer to NG bosons collectively as “pions.” 
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Fig. 9.1 Mass spectrum of n (3493 
light hadrons. All the masses p (938 

in parentheses are shown in 

MeV. Current numerical w (783) 
values as of writing this book = +p (775) 


can be found in the Review of 
Particle Physics [1] 


n (548) 
K?,K? (498) 
KË (494) 


n* (140) 
n? (135) 


m QCD vacuum 


The key to understanding the observed scale separation in the hadron spectrum 
is provided by spontaneous symmetry breaking (SSB). The gauge interaction of 
QCD is vector-like, that is, gluons couple equally to left- and right-handed quarks. 
Moreover, the interaction is insensitive to the quark flavor. As a consequence, in 
the limit of vanishing quark masses, QCD with nf quark flavors possesses a G ~ 
SU(n¢)L x SU(nf)r x U(l)g symmetry. Under an element (gL, gr, e£) € G, the 
left- and right-handed quark spinors WL, g transform as 

YPL > 3 of YL ; PR > e/F oR WR : (9.1) 
The unitary matrices gy p act on the flavor index of the quark field. The vector-like 
U(1)p group corresponds to conservation of baryon number, and the factors 1/3 
in (9.1) indicate the baryon number of a single quark. In addition to this internal 
chiral symmetry, QCD is invariant under the spacetime Poincaré group, and under 
the discrete symmetries of charge conjugation, parity and time reversal. 

In the ground state of (still massless) QCD, the chiral symmetry is spontaneously 
broken down to the “vector” subgroup, H ~ SU(nf)y x U(l)g, consisting of 
elements of the type (g, g, e€), that is, g = gr. This implies the existence of ne -1 
pseudoscalar NG bosons. Restricting to nf = 2 accounts for the lightest mesons: the 
pion triplet. The additional five modes that appear with nf = 3 are the strange 
pseudoscalar mesons (kaons) and the 7-meson. Of course, in reality, none of these 
is exactly massless. They are all pseudo-NG bosons owing to the fact that the chiral 
symmetry is only approximate, being explicitly broken by current quark masses. 
The fact that the strange mesons are heavier than the pions is a consequence of the 
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s quark being considerably heavier than the u and d quarks. The masses of the c, t 
and b quarks are so high (above the intrinsic scale of QCD) that with ng > 4, QCD 
does not possess even an approximate chiral symmetry. In practice, it is therefore 
sufficient to focus on the nf = 2, 3 cases. 


The classical Lagrangian of massless QCD is also invariant under the 
axial symmetry U(1)a, which amounts to YL R — eH g. Being also 
spontaneously broken in the QCD vacuum, this would suggest the existence 
of another, flavor-singlet, pseudoscalar meson. Yet the lightest available 
candidate is the 7’-meson with the mass of 958 MeV. This is too high to 
be accounted for by explicit symmetry breaking due to quark masses. The 
resolution of this so-called U(1), problem is that the axial symmetry is broken 
at the quantum level by the axial anomaly, arising from nonperturbative gluon 
dynamics. See Sect. 13.6 of [2] for further details. 


The EFT for the light pseudoscalar mesons, constructed below, is called chiral 
perturbation theory (ChPT).” The reader will find a modern graduate-level expo- 
sition of ChPT in the dedicated monograph [3]. However, the pioneering works of 
Gasser and Leutwyler [4,5] remain a valuable source of insight. The construction of 
ChPT will follow the general machinery developed in Sect. 8.2 with the appropriate 
coset space, G/H ~ [SU(ns)_L x SU (nf)r]/SU(nf)y. Here I have tacitly dropped the 
U(1)p factor of the symmetry group of QCD. This remains unbroken and moreover 
leaves the meson fields intact, and so has no effect on the invariant part of the ChPT 
Lagrangian. I will reinstate the U(1)g symmetry in Sect. 9.1.4, where it will help us 
build the quasi-invariant (anomalous) part of the Lagrangian. 


9.1.1 Power Counting 


In a Lorentz-invariant EFT, the effective Lagrangian is organized by the total 
number, n = s + t, of spatial and temporal derivatives, 


awl = > 4 le (9.2) 


n>2 


Discounting possible tadpole operators, linear in 7“, a minimum of two derivatives 
is enforced by the nonlinearly realized internal symmetry and Lorentz invariance. 
The latter is ensured by contracting Lorentz indices of spacetime derivatives with 


2 I will adhere to this established name but stress that it is a misnomer. The derivative expansion of 
ChPT, worked out below, has nothing do to with ordinary perturbation theory known from quantum 
field theory. 
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the Minkowski metric g,,, or the Levi-Civita (LC) tensor, €,,y.... Hence, in an even 
number of spacetime dimensions D (which is the case of QCD, where D = 4), only 
operators with even n can contribute to (9.2). 

In order to assess the relative importance of the individual parts 2 consider 
a generic Feynman diagram I, contributing to a given observable. Suppose the 
diagram contains altogether J internal propagators, L loops and V, interaction 
vertices from each of ace Upon Fourier transform, the diagram will evaluate to a 
homogeneous function of energy—momenta on the external legs. Adding up all the 
powers of energy-momentum from the vertices, propagators and loop integrals, the 
degree of the diagram as a function of the external energy—momenta then reads 


deg? = DL = 21 +Y nV. (9.3) 


n22 


The variable J can be eliminated using the identity J = L+}, Vn — 1, known from 
graph theory to hold for any connected graph (see, for instance, Sect. 13.4 of [6]). 
This leads to 


deg? = 2+ (D = 2)L +$ (n — 2)Vn . (9.4) 


n>2 


Diagrammatic contributions to a given observable can now be organized by 
increasing powers of energy-momentum; higher powers imply stronger suppression 
at low energies. Any Feynman diagram will have degr > 2. The leading-order 
(LO) contribution to the observable corresponds to deg I’ = 2 and consists of all 
tree-level (L = 0) diagrams with all vertices from ar . This justifies a posteriori 
the focus of this book on classical Lagrangians with the lowest possible number of 
derivatives. 

The degree deg I’ can be increased by adding vertices from higher-order (n > 3) 
parts of the Lagrangian or by adding loops. In D = 4 dimensions, the next-to- 
leading order (NLO) corresponds to deg I = 4. This can be reached in two different 
ways. Either we restrict ourselves to tree-level diagrams but allow for one vertex 
from 29 or allow one loop but keep all vertices from A It is easy to extend 
this reasoning to classify possible contributions to any observable at even higher 
orders of the derivative expansion. 

Let me conclude the discussion of power counting with several remarks. First, 
for pseudo-NG bosons, the propagator i/(p? — m°) is actually not a homogeneous 
function of the energy-momentum p” because of the mass m. That can be fixed 
by assigning the mass a formal counting degree 1 so that m? counts equally to 
p°. This affects the classification of operators in the effective Lagrangian of ChPT 
that incorporate the effects of explicit breaking of chiral symmetry by the quark 
mass, mg. We expect based on Sect. 6.4.1 that m will scale as „Mq. Hence, we 
can continue using (9.4), as long as we count the quark mass as a small quantity of 
degree deg mq = 2. Similarly, we will want to couple ChPT to a set of background 
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gauge fields, A,,. In order for the power counting to be consistent with background 
gauge invariance, we must assign these fields the degree deg A,, = 1. 

Second, the counting rule (9.4) also tells us how to renormalize the EFT. In order 
to remove an overall Sale in a Feynman graph I’, we need a counterterm 
among the operators in a D. Since all diagrams at deg = 2 are tree- 
level, the effective uoe. in Z D are finite ola that do not run with the 
renormalization scale. The effective couplings in Ze ) must include divergences 
that cancel against loop diagrams with degree n. Their running will be determined 
by the corresponding renormalization group equation. Important is that to any finite 
order in the derivative expansion, only a finite number of operators, and a finite 
number of counterterms, is needed. 

Third, note that for D = 2, all loop diagrams with the same V,,>4 and arbitrary 
Vz contribute at the same order of the derivative expansion. The lack of suppression 
of loop effects hints that the EFT is no longer weakly coupled. Eventually, it turns 
out that the infrared fluctuations of NG bosons are so wild in D = 2 dimensions that 
they destroy the order parameter leading to SSB. I will return to this in Sect. 15.2. 

Finally, the basic counting rule (9.4) and much of the above comments applies 
equally to any EFT with only type-A NG bosons, relativistic or not. The LO 
Lagrangian will still be a, The LO contribution to any observable will come from 
tree-level diagrams with all vertices from -2 a The classification of contributions 
at higher orders may however be modified by the presence of operators with odd 
n > 3 regardless of the spacetime dimension D. The details for any particular choice 
of effective Lagrangian and D are easy to work out using (9.4). 


9.1.2 Effective Lagrangian 


The coset space of QCD, G/H ~ [SU (nf)L x SU (nf)r]/SU(nf)y, is symmetric. Let 
me therefore start by recalling some general properties of symmetric coset spaces; 
see Sect. 7.3.2 for full detail. 

By definition, a coset space is symmetric if the Lie algebra g of G possesses an 
automorphism & that acts as identity on the unbroken subalgebra } and as minus 
identity on the complementary subspace g/b of g. Loosely speaking, R changes the 
sign of all broken generators of G while leaving all unbroken generators intact. The 
automorphism R can be, at least locally, lifted from the Lie algebra g to the Lie group 
G. One can then choose the coset representative U (zr) so that R(U (1)) = U (m)! 
With this choice, one can build a matrix-valued NG field that transforms linearly 
under the entire group G, 


Em) =Um, Em) S Ea, g) =8g8E RE . (9.5) 
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The automorphism R makes it easy to project out the broken component of the 
gauged Maurer—Cartan (MC) form (8.47), 


1 
Qı (x, A) = zl, A) — R(Q (x, A))] , (9.6) 


where A = A, dx" is the g-valued 1-form gauge field of G. Upon a brief 
manipulation using the definition of R, we find that 


Qı (m, A) = -EUT DECC = EUDE UC) , (9.7) 
where 
DX (mr) =dE (x) —iAE (r) +iEm)R(A) (9.8) 


is the G-covariant derivative of X(x), and DE (x)! is defined analogously. 

What does all this translate to in case of ChPT? Here R acts by swapping the 
transformations of left- and right-handed quarks, R(gL, gr) = (gr, gL). It is then 
natural to choose the coset representative as 


U(r) = (u(t), u(t) ') where u(r) € SU(nẹ) « (9.9) 


The matrix variable E(x) = (u(x), ue 2) is subject to the linear transformation 
Er) S (gL, 8&R)E CT) (ER, gL) !. All information about the NG fields can 
then be encoded in a single SU(n¢)-valued matrix variable, 


Un) = ule), Um) S28 UC, gL, gx) = gL U)gg!. (010) 


The Lie algebra of SU (nf)L x SU(nf)r is su(nf)L ®su(np)r. In our pair notation, the 
matrix-valued gauge field A,, can thus be decomposed as Ay, = = (AL, 1) + (1, AR), 
where ALR are iependien su(nf)-valued gauge fields acting respectively on 
the left- and right-handed quarks. A straightforward manipulation then leads to 
DEn) = (DU), U!) + (Ut), DU) t), where 


D, U) = pU) — AGU) + iU(r) AR , 


(9.11) 
D Um)! = pU! — IARU)! HUAT . 
This eventually gives 
Qı = -5 [0T Du, 1) + (1, (DUH u)] . (9.12) 


Before we can write down even the LO effective Lagrangian of ChPT, we still 
have to discuss the explicit breaking of chiral symmetry by the quark masses. 
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The mass term in the microscopic Lagrangian of QCD takes the form YL MYR + 
URM WL, where M = diag(m,, ma, ...) is a real diagonal matrix, collecting the 
quark masses. In the spirit of Sect. 8.2.3, this is promoted to a complex matrix field 
that transforms under the chiral symmetry as M ——> gLM8gR l thus restoring 
exact chiral invariance of the QCD Lagrangian. The basic building block for 
incorporating the effects of explicit symmetry breaking in ChPT is then the matrix 
field 


a(r, M) = u(t)! Mur)! . (9.13) 


Being linear in quark masses, this is assigned the order deg & = 2. 

The operators we now have for building the LO effective Lagrangian of ChPT 
are, schematically, Q 1 ,, OF and &. The only way to ensure invariance of these 
operators under the linearly realized unbroken subgroup, SU(¢)v, is to take a trace. 
Moreover, E has to enter the Lagrangian through tr(E + E7). This is required 
by parity invariance of QCD and the fact that parity interchanges left- and right- 
handed quarks, thereby acting on both u(z) and M by Hermitian conjugation. At 
the end of the day, there are only two independent operators one can put into the LO 
Lagrangian, 


fzB i i 
[MU m+ M Um)]. (9.14 


2 
LO = fn tr [D Um) DYUr)] + 


Accordingly, there are two independent parameters, conventionally denoted as fr 
and B, both with mass dimension 1. 


Example 9.1 


Let us work out some immediate consequences of the LO Lagrangian (9.14). To 
that end, I will drop the background gauge fields and parameterize U(r) in terms 
of a Hermitian traceless matrix T(m) as U(x) = expli (a )/fz]. This is a matrix 
version of the familiar exponential parameterization of the coset space; the factor 
fr is inserted to give II mass dimension 1. Upon expansion in powers of IT, the 
Lagrangian becomes 


gd JZB 


eff =y tr(M + MÝ) 


tiram] Ë [M+ Mn] +O), 
(9.15) 


where I used that M is ultimately Hermitian to drop terms linear and cubic in TI (7). 

The first, constant term contributes to the energy density of the chiral-symmetry- 
breaking vacuum. Taking the derivative of the vacuum energy density with respect 
to any of the quark masses in turn gives the vacuum expectation value (VEV) of 
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the corresponding mass operator; cf. (5.16). The latter serves as the order parameter 
for spontaneous breaking of chiral symmetry. The LO prediction of ChPT, based 
on (9.15), is that this so-called chiral condensate is independent of the quark flavor. 
The total condensate, summed over all quark flavors, equals 


(WW) = -nt fB . (9.16) 


Let us now turn attention to the bilinear part of (9.15). It is convenient to further 
parameterize the matrix TI (7r) in terms of ne — 1 meson fields x° as II (m) = m° Àa. 
Here i, is a basis of traceless Hermitian n¢ x nf matrices, normalized as tr(AgA,) = 
25ap. This ensures canonical normalization of the kinetic term for the meson fields. 
In case of nf = 2, the Ags are just the usual Pauli matrices Ta, whereas for nf = 3, 
the so-called Gell-Mann matrices can be used instead. The next step is to find the 
eigenvalues of the mass matrix for x°. Setting nf = 3 and using the known flavor 
composition of the light pseudoscalar mesons, these eigenvalues can be identified 
with the masses of the individual states in Fig. 9.1 as 


2 2B f 2 2 
7 = 3 My + Mq + Ms — | Mmg + my + Mg —MmymMg—MyMs — MqMs | , 


m 


m2. = B(m, + ma) , 

mys = B(mu + ms) , (9.17) 

mio = B(ma + ms) , 
> 2B 


m; 3 (m +mqa + ms + m2 + mŽ + m? My Mq — MyMs mams) ; 


This determines the five different meson masses in terms of the three current quark 
masses, yet the latter are not uniquely fixed by (9.17). Indeed, any overall rescaling 
of the quark masses can be absorbed into a redefinition of the B parameter. It is 
however possible to use (9.17) to eliminate B and the quark masses altogether and 
thus obtain a constraint on the meson spectrum, 


2(m2 


T 


+ mge + mio) = 3(m29 + m?) . (9.18) 


This so-called Gell-Mann—Okubo formula can be interpreted, for instance, as giving 
an estimate for the mass of the 7-meson in terms of those of the pions and kaons. 
With the data shown in Fig.9.1 as input, one gets my ~*~ 568 MeV, which is less 
than four per cent off the correct value. 


The machinery developed in Sect. 8.2 can in principle be applied to an arbitrarily 
high order of the derivative expansion of ChPT. However, already at NLO (n = 4), 
constructing the effective Lagrangian is a nontrivial exercise. I will therefore content 
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myself with spelling out the final result and refer the reader to Sect. 3.2 of [7], where 
all details are worked out rather pedantically. For both nf = 2 and nf = 3, the 
invariant part of the ChPT Lagrangian at NLO can be written as 


LO = cy (D UDU D,UD?U) + cp (D, UDU) tr(DyUD?U") 
+ c3 t(D UDU) t(D UDU) 
+ c4te(Fy, DUD’ UÝ + FR DY UÝ DU) 
+ cs (FUF UT) + col F FH + FR F) (9.19) 
+ di (MUP) (MU) + da[ (t MU")? + (tr MIM)? ] 
+ d3 (MUMU + MUMU) + da tr(MMŻ) 
+ ds t(MUŻ + MİU) t(D, UDU) 
+ de tr [(MU} + UM") D,UDEU"}] . 


Here F i and F a are the field-strength tensors of the background gauge fields. The 
12 effective couplings cj_¢ and d\_¢ are mutually independent in the nf = 3 case. In 
case of nf = 2, special properties of 2 x 2 matrices make the cı operator redundant 


with c2, and the dg operator redundant with ds. 


The form of the NLO Lagrangian shown in (9.19) matches the detailed 
derivation offered in [7]. However, in the literature, a somewhat different basis 
of operators is often used. For the reader’s convenience, I list here detailed 
relations between the parameters c1-6, dj-6 introduced above and the more 
common NLO couplings of ChPT L110, H1-2, cf. Sect. 3.5.1 of [3], 


c = L3, 0 = Li, c3 = L2, c4 = —iLo , 
c5 = Lio, c6 = M, dı = 2(L6 — L7) , d = Le + L7, 
dz = Lg , d4 = M, ds = L4, de = L5. 


(9.20) 


9.1.3 Interaction with External Fields 


For further illustrations of the use of ChPT, I will utilize its simplest version: the 
nf = 2 ChPT in the “isospin-symmetric” limit. In this limit, one sets m, = mg = m 
so that all three pions have the same squared mass, mŽ = 2Bm. Accordingly, the 
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LO Lagrangian (9.14) reduces to 


2 
Log = f Tuo ut D"an + m2 (U+ U’)]. (9.21) 
The kinetic term can be further expanded and organized by powers of the back- 


ground gauge fields, 
t(D U DEU) = tr [Ip U U — iA; (UI U — I UU") (9.22) 
—iARCU a" U — UU) 
— 2UARU' AM + AT A + ARA]. 


Knowing the explicit dependence on the background fields is, among others, a useful 
starting point for deriving the Noether currents of chiral symmetry. I will however 
focus on illustrating the implications of some particular choices of actual, physical 
background fields. 


As explained in Example 8.7, a chemical potential parameterizing the statistical 
equilibrium of a many-body system can be introduced in the Lagrangian as a 
constant temporal gauge field. Thus, the effects of nonzero density of isospin, that 
is the diagonal generator of SU(2)y, can be captured by setting 


T3 
AL = AÈ = 5,011 , (9.23) 


where ur is the isospin chemical potential. The actual statistical ground state is 
found by minimizing the Hamiltonian of ChPT with respect to U. A detailed 
analysis shows that the ground state can be represented by a real orthogonal matrix, 


(U) = ( cosg a l (9.24) 


— sina cosa 


For |u| < mz, the value of the angle œ minimizing energy is a = 0, implying 
(U) = 1. This is the usual QCD vacuum. On the other hand, for |u| > mx, the 
ground state corresponds to cosa = m2 / bie This state describes Bose-Einstein 
condensation of charged pions. The condensate carries nonzero isospin density 
obtained as minus the derivative of the Hamiltonian density with respect to ur, 


. m$ 
nt = fèu sin? a = fèu ( = ae . (9.25) 
I 
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Next, let us look at the spectrum of excitations above the ground state (9.24). 
This can be extracted by expanding the Lagrangian (9.21) to second order in 
fluctuations around (9.24). In the vacuum phase (|u| < mz), the neutral pion 
maintains its relativistic dispersion relation, E,0(p) = y p? + m2. The energy of 
the charged pions is, on the other hand, trivially shifted by the chemical potential, 


E,+(p) = y p° + m2 + uy. In the pion condensation phase (|u1| > mx), the 
dispersion of the neutral pion changes to 


E,,0(p) = ¥ p? + 2 . (9.26) 


This is a relativistic-looking dispersion, except that the “mass” equals |ur|. That is 
not a coincidence. In the pion condensation phase, the neutral pion mode can be 
interpreted as a massive NG boson of the isospin SU(2)y symmetry; see Sect. 6.4.3 
and [8] for details. In the charged pion sector, it is no longer possible to distinguish 
isospin (or electric charge) eigenstates as a result of SSB. There are two excitation 
branches that are mixtures of z* and x7, and their squared energies are 


2 
Ex(p) = p + Aa + 3 cos? a) + “ja +3 cos? æ)? u? + 16p? cos? a . 
(9.27) 


The lower of the two branches is gapless, E_ (0) = 0. This is the NG boson of the 
spontaneously broken isospin symmetry. Further discussion of meson condensates 
in QCD can be found for instance in the pedagogical review [9]. 


QCD alone does not encompass all of particle physics. Hadrons can also interact 
via the weak and electromagnetic forces. ChPT makes it easy to couple pseudoscalar 
mesons to the electroweak sector of the Standard Model (see, for instance, Sect. 20.2 
of [10] for an overview). Indeed, we can imitate the coupling of quarks to the 
electroweak gauge bosons by setting 


L_ E g' R 
Ai = qt Aut gPa f A = g QB, ; (9.28) 
Here A,, is a triplet of potentials of the weak isospin gauge group, SU(2)r. Similarly, 
B,, is the potential of the hypercharge gauge group, U(1)y. The corresponding 
gauge couplings are g, g’. Finally, Q is the matrix of electric charges of the u and d 
quarks, 


Eo © 54, 28 
TE Pa) = git ge: (9.29) 
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A complete set of LO electroweak interactions of pions is then obtained by inserting 
the above definitions into (9.22). 


A glance at (9.22) shows that adding the electroweak gauge sector leads to nontrivial 
effects even in the ground state of ChPT, (U) = 1. Namely, the last three terms 
in (9.22) generate a mass term for the electroweak gauge bosons, 


fa 


2 
L aL RAR LAR Í L Ry; aL R 
eae +AA — 2AA") = g [M — Awa n — A*+)] 


(9.30) 


2 
a ee Al)? + (8427 


+ (gAj, — 8’ By)’ | 
_ fi z + u 12 u 
z Pewi WH+(g*+2 Zp2" | : 
Here, I introduced the charged gauge boson fields by Wie = (Ai, + +iA2 2)/v2, 
and the neutral weak gauge boson via Z,, = = A} cos Ow — B, Sipa. Finally the 
Weinberg angle Ow is related to the gauge couples by ee = g/Vg7 + 8°. 
Thus, spontaneous breaking of chiral symmetry in QCD leads to the following 
contributions to the masses of the electroweak gauge bosons, 


1 1 
=f > mz = The +g’). (9.31) 


Given the characteristic scale of QCD, encoded in the value of fr (fixed precisely 
below), these contributions are tiny. However, the idea that the gauge boson masses 
might be generated by a strong dynamics that spontaneously breaks chiral symmetry 
is intriguing. It lies behind the “technicolor” scenario of dynamical electroweak 
symmetry breaking. In this scenario, the Higgs boson is not elementary, but rather 
a composite bound state of constituent “techniquarks.” The value of fr is expected 
to be of the order of the electroweak scale, that is a few hundreds of GeV. See [11] 
for a pedagogical introduction to technicolor models. 


The physical value of f; can be fixed by likewise utilizing the coupling of pions 
to the electroweak sector of the Standard Model. All we need is a single observable 
that does not depend on any other as yet unknown parameter. A suitable candidate 
is the leptonic decay of the charged pion. 
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Fig. 9.2 Feynman diagram for the leading (tree-level) contribution to charged pion decay. The 
x *-W* coupling is provided by ChPT whereas the interaction vertex between W+ and the 
charged lepton current follows from the Standard Model of electroweak interactions 


The charged pion * decays with the probability of 99.988% into an antimuon pt 
and a muon neutrino v. The leading perturbative contribution to the amplitude 
for this decay is shown in Fig.9.2. The conversion of an on-shell pion into a 
virtual W-boson is described by our Lagrangian (9.21). Indeed, by expanding it 
to the first order in both WF and 2* = (r! + in?) /V2, we find the bilinear 
term —(fzg/2)(W*" 3 r + W “d,27*). The subsequent decay of the virtual W- 
boson into a lepton pair is governed by the “charged-current” interaction of the 
Standard Model, specifically the operator (¢/V/2)(W,t iy" but + Wy, Duty" uL). 
I used an obvious notation for the spinor fields representing the leptons. Putting 
all the pieces together, the invariant amplitude for the 7+ — yt + v, process 
becomes 


Agt+pt+y, = ifrGru(p)k( — ys)v(q) . (9.32) 


Here k”, p”, q” are respectively the four-momenta of the pion, muon neutrino and 
antimuon. Also, u(p) and v(q) are the corresponding Dirac spinors; polarization 
indices are suppressed for clarity. Finally, Gp is the Fermi coupling constant, 


2 


E -5 =2 
Gr = ——— 7 1.166 x 10 °GeV ~. 9.33 
Bean oe) 


Upon squaring the amplitude and summing over polarizations of the particles in the 
final state, the integrated decay rate in the rest frame of the pion is found to be 


fe Gp mp 0ng ~My) 
Pats +, = 4 


= z (9.34) 


Fig 
I have treated the neutrino as a massless particle. The masses of the pion and 
antimuon are, respectively, mz ~ 139.6 MeV and m, ~% 105.7 MeV. Finally, we 
need an input on the lifetime of the charged pion, t ~ 2.60 x 1078 s. This converts 
to the total decay rate of T ~ 2.53 x 1078 eV. At the end of the day, we get an 
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estimate for the pion decay constant, 
fr © 91 MeV . (9.35) 


The result (9.34) tells us more than merely a good estimate for the total decay rate 
of the charged pion. Namely, leptons from different families have identical weak 
interactions. Upon replacing m, with the electron mass me, (9.34) therefore also 
gives us a decay rate for the process 7+ — et + ve. This is more conveniently 
expressed in terms of the branching ratio, 


Patset+y, om (m= — m 


Rasan © =E r e n128 x104. 9.36 
MEETA NE Portsptty, mî, (m2 — m?) i 


This is very close to the experimental value, which is about 1.23 x 1074 [1]. The 
suppression of the electron decay channel compared to the muon one is purely 
kinematical. By angular momentum conservation, one of the leptons in the final 
state must be left-handed and one right-handed. Yet, the W-boson only couples to 
left-handed fermion fields. The combination of these two effects requires a helicity 
flip and is responsible for the proportionality of (9.34) to the lepton mass squared. 


9.1.4 Effects of the Chiral Anomaly 


So far I have tacitly assumed, following Sect.8.2, that the effective action of 
ChPT is gauge-invariant in presence of the background fields. This allowed us to 
construct strictly gauge-invariant Lagrangians at LO (9.14) and NLO (9.19) of the 
derivative expansion of ChPT. Are there any contributions to the ChPT Lagrangian 
that are merely quasi-invariant? A detailed derivation of such contributions and 
their coupling to background gauge fields would require a differential-geometric 
approach akin to that of Sect.8.1. The problem of finding all such Wess—Zumino 
(WZ), or Wess—Zumino—Witten, terms was studied thoroughly in the 1980s and 
1990s. For a discussion close in spirit to this book, I refer the reader to [12,13]. A 
pedagogical account of the method including explicit expressions for quasi-invariant 
Lagrangians for a broad class of coset spaces can be found in [14]. 

Here I will resort to a trick, which gives the right answer in case of nf = 2 quark 
flavors. Suppose we were able to construct a current J“, conserved off-shell, that 
is without imposing the equation of motion (EoM) for the NG fields in our EFT. If 
in addition the EFT includes an Abelian gauge field A,,, then the operator A, J” is 
quasi-invariant and can be added to the Lagrangian density. It remains to guess what 
J” and A, might be within ChPT. The current is the tricky bit. For the moment, I 
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will simply write it down, a partial a posteriori justification will be offered below; 


IEn = JeltoB tr [ununun 
3i (9.37) 
1 
— FS OvUU) Fag — tiuri] 


This is the Goldstone—Wilczek (GW) current; the overall factor A is in principle 
arbitrary. In the absence of the background fields, the GW current would be 
manifestly conserved thanks to the antisymmetry of the LC tensor and cyclicity 
of trace. With the background in place, it is a nontrivial but straightforward exercise 
to show that? 


tide = sh ett tr (— FE, Fog + FR Fs) - (9.38) 
The current is not conserved as promised, except for backgrounds that are purely 
vector-like, AL = AR. Luckily, this is not a problem. In fact, the nonconservation 
of the GW current turns out to be exactly what is needed to implement correctly the 
microscopic physics of QCD within ChPT. 

To that end, recall that the flavor symmetry of QCD has a single U(1) factor, 
namely the baryon number U(1)g. This can also be coupled to a background gauge 
field, AB, even if such a field may not have an obvious experimental realization. 
The presence of a coupling AB Jey in the ChPT action implies that it is possible to 
create baryon number solely out of meson fields. This intriguing possibility was first 
proposed by Skyrme in the 1960s. In presence of the chiral background fields ALR, 
baryon number is not conserved due to the chiral anomaly. An explicit calculation 
(see Sect. 22.3 of [15]) shows that this anomaly is reproduced at the ChPT level 
by (9.38) if we set A = —1/(247). This brings us to the final expression for the 
WZ term in the ChPT Lagrangian for ns = 2 quark flavors, 


1 cF a 
= -o ALN | (UD, UNUD UUD U) 


7 
- F [Dun Fi - tiuri] 
(9.39) 


The superscript ® indicates that the WZ term enters at the NLO of the derivative 
expansion of ChPT. Note also that it does not come with an arbitrary coupling. The 


3 A reader willing to check this should note that the GW current is manifestly gauge-invariant. It is 
thus sufficient, and advantageous, to compute its gauge-covariant derivative. This maintains gauge 
invariance at every step and simplifies the calculation thanks to the Bianchi identity for F ae 
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normalization of the WZ term is fixed by anomaly matching and does not receive 
any radiative corrections. 


The WZ term (9.39) is manifestly invariant under background gauge trans- 
formations from the SU(2)L x SU(2)p chiral group. On the contrary, under a 
background U(1)g transformation, AB => AB + d,€8, the corresponding WZ 
action changes by 


8Swz = — i dive O)ouJcg E), (9.40) 


with the divergence of ee given by (9.38). Watch the interplay of two 
symmetries: the variation of the action under U(1)g is proportional to the 
background fields for SU(2), x SU(2)r. What we have here is an example of 
a mixed ’t Hooft anomaly. This should be contrasted to the naive U(1), axial 
symmetry of QCD. The divergence of the axial current receives a contribution 
from the dynamical gluon fields, giving an example of an Adler—Bell— 
Jackiw anomaly. This kind of anomaly fundamentally invalidates a would-be 
classical symmetry of a quantum system. On the other hand, a symmetry 
exhibiting a °t Hooft anomaly still implies exact relations (Ward identities) 
for the generating functional of the theory. This makes ’t Hooft anomalies a 
powerful tool for constraining low-energy EFTs, as I have illustrated here. 


The construction of the WZ term (9.39) is not a mere academic exercise, as one 
might suspect from the presence of the “baryon number gauge field.” The term has 
measurable consequences for the electromagnetic interactions of pions. To see why, 
recall (9.29), which shows that electric charge does not belong to the chiral Lie 
algebra su(2)L x su(2)p due to not being traceless. Interactions of pions with an 
external electromagnetic field (with all other background fields switched off) can 
then be generated by setting 


L_ ysR_&_ 4@ B_ © 40 
AR=AR= SAR, AR = oAg. (9.41) 


Here Ag is the electromagnetic gauge potential and e the electromagnetic coupling. 
The effects of interaction with the electromagnetic field via the WZ term are most 
striking in case of the neutral pion. 


Example 9.5 


Let us keep only the electromagnetic background field and the neutral pion 7°. The 
charged pions are discarded by using the replacement U —> exp(ir°t3/f,). Upon 
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simple integration by parts, the whole WZ term (9.39) then boils down to 


rle pO po (9.42) 


(4) 
Lyz > ~ 32x? fy uva’ 


This operator governs the electromagnetic decay of 2° into a pair of photons. 
Denoting the four-momentum of the pion as k” and those of the photons as p”, q”, 
the invariant amplitude for the decay turns out to be 


2 
e 
Anos yty = An? fa eh? Dye; (P)qu€p(q) - (9.43) 


Here é (p) and e (q) are the polarization vectors of the photons in the final state. 
It remains to take the square and sum over polarizations of the photons. The final 
result for the decay rate in the rest frame of the pion is 


r _ omy 9.44 

W9>y+y = 64mr3 f2 ’ (9.44) 
where a = e? /(47) is the fine structure constant. Using the numerical input mz ~ 
135.0 MeV and a ~ 7.297 x 107° along with the value for fy found in Example 9.4, 
our final result is T0 = 8.0eV. This is less than 3% off the experimental 


n’—>y+y 
value of 7.81 eV [1]. 


The two-flavor WZ term (9.39) has remarkable physical consequences, yet 
vanishes by construction in the absence of external fields. A new twist in the story 
comes for nf = 3 quark flavors. Here another WZ term appears, which remains 
nonzero even in the absence of background fields. This governs scattering processes 
with an odd number of mesons, such as K+ + KT —> mt +27 + 2°, which 
would otherwise be forbidden in ChPT. The mathematical origin of this WZ term 
parallels that of the quasi-invariant Lagrangians with one time derivative, analyzed 
in Sect. 8.1. The Lagrangian density of the WZ term can be mapped to a 4-form 
æwwz such that the 5-form dwwz is chirally invariant and closed but not exact. Such 
5-forms are classified by the fifth de Rham cohomology group of the coset space. 
The coset space [SU(3)L x SU(3)r]/SU(3)y has a unique generator of degree-5 
cohomology, 


deowz, x tr [(U aU) A (UAU A (UAU A (UAU) A (Ud) . 
(9.45) 


The overall normalization is again fixed by matching to the flavor anomalies of 
QCD. More details about the geometric nature of this WZ term and its coupling to 
background gauge fields can be found in the original work of Witten [16]. 
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The two-flavor coset space [SU(2), x SU(2)r]/SU(2)v, being three- 
dimensional, obviously has vanishing fifth cohomology group. However, it does 
have a nontrivial third de Rham cohomology with a single generator, 


wow « tr [(U dU) A (Ud) A (Ud) . (9.46) 


When pulled back to the four-dimensional Minkowski spacetime, wgw is just the 
Hodge dual of the GW current (9.37) in absence of external fields. This explains why 
such an identically conserved current exists in the first place. Moreover, the integral 
of Je over R? defines a conserved charge that is a topological invariant. Upon 
a suitable normalization, it coincides with the Brouwer degree of the pion fields 
viewed as a map S? —> S?; cf. Example A.25. A skyrmion is a configuration of pion 
fields for which the topological charge is nonvanishing. Thanks to the coupling to 
AB in (9.39), the topological charge has the interpretation as baryon number. This 
provides a mathematical foundation for the Skyrme model of baryons. 


9.2 Spin Waves in Ferro- and Antiferromagnets 


I have already used ferromagnets repeatedly to illustrate various features of SSB, 
including the peculiarities of the spectrum of NG bosons in nonrelativistic systems. 
In order to make the present section self-contained, I will however start with a 
concise summary of the basic facts. 

Ferro- and antiferromagnets are phases of matter that exhibit spin order. 
Although such order may also be induced in relativistic matter, I will have implicitly 
in mind its realization in ordinary crystalline solids. The advantage of this restriction 
is that in the nonrelativistic limit, spin can be treated as an internal degree of 
freedom. One can then base the construction of EFT for (anti)ferromagnets on 
spontaneous breakdown of the internal G ~ SU(2) spin symmetry. In both types of 
systems, the unbroken subgroup is H ~ U(1), corresponding to spin rotations about 
the axis of spin alignment. The coset space is therefore G/H ~ SU(2)/U(I) ~ S?. 
From the point of view of low-energy EFT, the only difference between ferro- and 
antiferromagnets is a nonzero VEV of spin in the former. This is directly reflected 
by the spectrum of NG bosons: spin waves, or magnons. In ferromagnets, there is a 
single type-B magnon, whereas antiferromagnets feature two type-A magnons. 

The generators of G ~ SU(2) can be taken as t4/2. Without loss of generality, 
we may choose the spin axes so that H ~ U(1) is generated by 13/2. The coset space 
SU(2)/U(,) is then symmetric thanks to the inner automorphism R(g) = RT! gR 
with g € SU(2) and R = it3. This makes it possible to map the coset representative 
U(x) on a unit-vector variable n() € S? via 


t- n(x) = N(x) = U(r) t = U(m)nU (ay! . (9.47) 
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The matrix field N (zr) transforms linearly in the adjoint representation of G. As a 
consequence, the G-covariant derivative (9.8) boils down to 


Dn(a) = dn(z) + A x n(z), (9.48) 


where A, is a triplet of background gauge fields of SU(2). The covariant derivative 
D,,n(zr) is the basic building block for construction of EFT for (anti)ferromagnets. 

In addition to the internal spin symmetry, I will assume invariance under 
continuous spacetime translations and continuous spatial rotations. This is of 
course just a crude idealization of real materials, where such ideal symmetry may 
be explicitly broken by a variety of perturbations. These include especially the 
anisotropy induced by the underlying crystal lattice, and the effects of spin-orbit 
coupling. I will nevertheless initially assume the ideal, unperturbed limit. An outline 
of some phenomenological consequences of explicit symmetry breaking is deferred 
to Sect. 9.2.3. 


9.2.1 Power Counting and Effective Lagrangian 


The general philosophy of the derivative expansion of the EFT for (anti)ferromagnets 
copies closely that for ChPT, detailed in Sect.9.1.1. However, the two cases 
differ substantially due to the qualitatively different spectra of (anti)ferromagnetic 
magnons. I will start with the more nontrivial, genuinely nonrelativistic case of 
ferromagnets. 


9.2.1.1 Ferromagnets 

The energy of ferromagnetic magnons is quadratic in momentum, at least in the 
long-wavelength limit. In order to assign to a given Feynman diagram a well-defined 
degree, we therefore count momentum as order one and energy as order two. The 
Schrédinger-like propagator of the magnon then has overall degree —2. In close 
parallel with (9.3), the degree of a Feynman diagram I’ becomes 


deg T = (D+ 1)L — 21 + (s + 24) Vs . (9.49) 


s,t 


Here V;; denotes the number of vertices from Zo the part of effective 
Lagrangian with s spatial and t temporal derivatives. As before, the number of 
propagators J can be eliminated via J = L + )?°, , Vs, — 1, which leads to the 
final result 


degr =2+(D— IL + (8 + 2t — 2)Von (9.50) 
s,t 
The LO of the derivative expansion, deg = 2, corresponds to tree-level 


diagrams (L = 0) with all vertices satisfying s + 2t = 2. Thus, the LO effective 
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Lagrangian consists of ge and Aa . Before we construct these, let us briefly 
consider higher orders of the derivative expansion. First, note that unlike in ChPT, 
here we have a well-defined derivative expansion even in D = 2 spacetime 
dimensions. Indeed, ferromagnetic order that can be described by a derivatively 
coupled low-energy EFT exists even in one-dimensional spin chains. This is special 
to type-B NG bosons, as I will show in Sect. 15.2. 

What exactly constitutes the NLO of the derivative expansion depends on the 
number of dimensions. For D = 2 (ferromagnetic chains), the NLO corresponds to 
deg I = 3. It collects contributions from one-loop diagrams with all vertices from 
the LO Lagrangian, and from tree-level diagrams with one vertex from A , if any 
such operators exist. (They may be forbidden by parity.) In D = 4 dimensions (bulk 
ferromagnets), on the other hand, there are two options. In case oo is allowed, 
then tree-level diagrams with one such vertex constitute the sole contribution with 
deg I’ = 3. Otherwise, the NLO corresponds to deg = 4, and consists of tree- 
level diagrams with one vertex from a ZaD, or ZRA, Up to and including 
NLO, there are no quantum corrections; loops only start contributing at degr = 5. 
Perhaps the most interesting is the case of D = 3 (thin ferromagnetic films or 
layers). Barring the possible existence of ZEN, the NLO here is degr = 4. It 
includes both one-loop diagrams with all vertices from the LO Lagrangian and tree- 
level diagrams with one vertex from the NLO Lagrangian. 

Clearly, the setup of the derivative expansion depends very sensitively on the 
specific choice of material (which affects discrete symmetries such as parity) and 
sample (which controls the dimension D). I will therefore limit the discussion to 
the effective Lagrangian at LO. A detailed classification of possible operators up 
to order four in derivatives, including the effects of the discrete crystal and time- 
reversal symmetries, can be found in [17]. 

The oe part of the LO Lagrangian is trivial. According to the general analysis 
in Sect. 8.2, we expect it to be of the type Kqpd"* QS Qb , Where the coupling Kap must 
be invariant under the adjoint action of H ~ U(1). The broken part of the gauged 
MC form (9.7) now takes the specific form 


Qı (7, A) = -LUT DNOU) = EBU DNU) . (9.51) 


The H-invariant part of the symmetric tensor product of 2, with itself is projected 
out by taking the trace. This leads immediately to the Lagrangian 


e 


An -= t[DN Gr) - DN(x)] = — $5" D,n(r) -Dyn(t). (9.52) 


The parameter ọs is called spin stiffness and controls the gradient energy arising 
from “bending” the uniform ground state magnetization. 
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In d = 2 spatial dimensions, it is also possible to construct an invariant 
operator using antisymmetric tensor product, Xap£™ Q4 oe This leads to the 
operator e’*n - (D n x Dsn). In the absence of background fields, the latter 
is a pure surface term; its integral over R? gives, up to normalization, the 
Brouwer degree of the map n : R? —> S*. That is however no longer the case 
when the EFT is coupled to background gauge fields. A quick calculation 
shows that up to surface terms, en - (D-n x Dsn) ~ en - Fps, where 
F uv = 0, Ay—0)A,+Ay, X Ay is the field strength of the background. When 
added to the Lagrangian, this operator will modify the EoM for spin waves; 
cf. the discussion of EoM in Sect. 8.3. In the following, I will nevertheless 
disregard this contribution to the EFT. First, it only exists ind = 2 dimensions 
and moreover violates time reversal, under which n(x,t) — —n(x,—t). 
Second, it requires a specific, nontrivial background to be nonzero, and thus 
does not affect the propagation of free magnons. 


Let us now focus on the A part of the LO Lagrangian. According to the 
general discussion in Sect. 8.2, this reads 


LEY) = -Momi + Mv (a) AG , (9.53) 
where M is the density of spin (magnetization) in the ferromagnetic ground state. 
The second term in (9.53) is seen to equal M Ag - n(x). To evaluate the first term, 
we project out the third component of the MC form by taking trace with t3. Then 


we apply the exponential parameterization, U (r) = exp(iz“t,/2), and use (7.31), 


— Mo} (1)x* = iM tr [t3U (r) U(x) 


M. a j —1 
= = dt tr [c3U (t7) TU (t1)| 
0 
1 1 
= -mż“ f dtng(tm) > mz“ f dT altr), (9.54) 
0 0 


where ~ indicates equality up to a total derivative. The index a runs over 1, 2, we 
can however formally extend z“ to a three-component vector x and write m'ña = 
mx- = (n x m )-(n xn). Using the definition (9.47) to take the derivative 0; N (t7), 
we find that n(tz) x m = 0;n(t7). Putting all the pieces together, we then arrive 
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at the final expression for the LO effective Lagrangian for ferromagnets, 


1 
EAr = mf dt 0,n(tz) -[n(tz) x ùl(tr)]+ M Ao- n(x) 
9 (9.55) 


z S8" Dnr) . Dyn(x) . 


The second term in (9.55) is the usual Zeeman coupling of spin to an external 
magnetic field, whereas the third term represents the gradient energy of the spin 
configuration. The first term, however, deserves a comment, since the presence of 
integration over the parameter t makes it look nonlocal. To get rid of the integral, let 
us take a step back. Consider a one-parameter family of fields, n (t, 7), t € [0, 1], 
such that n (0, 2) = (0, 0, 1) = mo (ground state) and n(1, 7) = n(z). The explicit 
choice of interpolation used in (9.55) corresponds to n(t, 7) = n(t7m). It is easy 
to check that upon a smooth deformation of the field, òn (t, 7), the variation of the 
action only depends on $n(1, 7) = n(x). The concrete choice of interpolation 
between the t = O and t = 1 limits therefore does not matter. Assuming for 
simplicity that n>(x, t) as a function on the spacetime is non-negative everywhere, 
we can change the interpolation to 


n(t,) = (tal (x), tn? (x), V1 — tEn! (xr) + (n20))?)) , (9.56) 
which makes it possible to carry out the integral over T, 


Eabn (A? (71) 


LO _ 
et = 1+73(7) 


e 


+ MAg-n(z) — Sa D,n(r) - Dsn(z). (9.57) 


This is as far as we can get. The Lagrangian is local and manifestly invariant under 
H ~ U(1). Moreover, it only depends on the NG fields x° through n(x), and 
thus does not rely on the exponential parameterization of U (x), originally used to 
derive (9.55). 


9.2.1.2 Antiferromagnets 

As far as the construction of the effective Lagrangian is concerned, antiferromagnets 
are much simpler than ferromagnets. The spectrum consists of two type-A NG 
bosons whose energy is, in the long-wavelength limit, linear in momentum. For the 
sake of power counting, we therefore have to treat spatial and temporal derivatives 
on equal footing. The resulting expression for the degree of a given Feynman 
diagram is a trivial generalization of (9.4) we found in ChPT, 


degr =2+ (D -= 2L +J (s +t- 2)Vs. (9.58) 


St 
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The only difference to ChPT is that spatial and temporal derivatives may enter the 
effective Lagrangian independently. 

Owing to the same symmetry-breaking pattern, the building blocks for construct- 
ing the EFT are the same for ferro- and antiferromagnets. The main difference 
is that the part with a single time derivative, 2 , is missing in the latter; 
antiferromagnets have zero net magnetization. The LO Lagrangian then consists 
of 2 and a, and we can write it down at once, 


GO = 5,3 [Pom (r) - Don(s) — 028" D,n (r) - Dsn()] . (9.59) 
There are two independent parameters which are easy to relate to physical observ- 
ables. The spin stiffness 9, measures the gradient energy of the order parameter, 
whereas v turns out to be the phase velocity of antiferromagnetic magnons. 

Similarly to ferromagnets, the organization of the derivative expansion beyond 
LO depends sensitively on D and the presence of discrete symmetries such as parity 
or time reversal. I will therefore stop the discussion of power counting here, and 
turn to the consequences of the EFT at LO. 


9.2.2 Equation of Motion and Magnon Spectrum 


We already know the number and type of magnons in both ferro- and antiferro- 
magnets. However, the EFT tells us more, in particular what the corresponding 
fluctuations of the order parameter look like. To that end, I will drop the background 
gauge fields and derive the EoM corresponding to the LO effective Lagrangian. 

Let us start with ferromagnets. As hinted above, taking a variation of (9.55) gives 
a surface term in t, which allows one to write the variation of the action solely in 
terms of òn (7), 


SLO = f dx 8n- (Mn x ù + ọsV°n) . (9.60) 


The variation òn is not arbitrary but rather should keep n on the coset space, that is 
the unit sphere S*. In other words, 8” should be a tangent vector to the sphere. The 


vanishing of hig therefore requires that 


Mnxn+ osV7n = An; (9.61) 


à can be interpreted as a Lagrange multiplier for the constraint n -n = 1. We can get 
rid of it by taking a cross product with n, which gives the Landau—Lifshitz equation, 


a= nx Vn. (9.62) 
M 
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Previously, in Sect. 4.3.1, I derived this equation using the Hamiltonian (symplectic) 
formulation of field theory from the postulated Poisson bracket for the spin variable 
n(x); cf. (4.29). The Lagrangian and Hamiltonian descriptions of ferromagnets are 
of course equivalent. For instance, the symplectic 2-form (4.33) can be recovered 
by noting that according to (9.53), the symplectic potential is —Mw*. The exterior 
derivative thereof is easily evaluated using the MC equation (8.11). This shows that 
the fundamental Poisson bracket for n(x) is already automatically built in the low- 
energy Lagrangian EFT for ferromagnets. 

To solve the Landau-—Lifshitz equation (9.62) is a hard problem due to its 
nonlinearity. What one can do easily is to linearize the equation in small fluctuations 
around the ground state. Inserting n = mo + 6n and keeping only terms linear in òn, 
we get 

Os 


sa = — Von. 9.63 
ñ = Mo x n (9.63) 


We can now look for plane-wave solutions by using the ansatz 
n(x, t) = Ae tFtelP* | (9.64) 


where A is a complex amplitude orthogonal to np = (0, 0, 1). Inserting the ansatz 
in (9.63) shows that the energy and momentum satisfy the dispersion relation 


E(p) = Sp , (9.65) 


typical for type-B NG bosons. The amplitude must satisfy the constraint iA = no x 
A. This is solved by any A « (1, —i, 0). Ferromagnetic spin waves are circularly 
polarized in the plane transverse to the direction of the ground state magnetization, 
no, regardless of the direction of momentum p. This can be understood as Larmor 
precession of individual spins around the effective magnetic field generated by the 
spin-polarized background. 

Antiferromagnets can be treated in the same way, without the complications 
due to the single-time-derivative operator in the Lagrangian. The EoM obtained 
from (9.59) can be written as 


& (aBn — v? Y?n) = An, (9.66) 
v 


where A is a Lagrange multiplier. Upon linearization around the ground state, no, 
we find plane-wave solutions of the same general form as in (9.64). However, 
the dispersion relation is now E(p) = v |p|, typical for type-A NG bosons. The 
complex amplitude A must be orthogonal to no. We conclude that antiferromagnetic 
spin waves are also polarized in the plane transverse to the direction of no, regardless 
of the direction of momentum. However, the polarization can be both linear and 
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circular, or in general elliptic. There are therefore two different, independent types 
of antiferromagnetic magnons, which can be chosen to be linearly polarized. 


9.2.3 Effects of Symmetry-Breaking Perturbations 


So far, we have assumed exact symmetry under SU(2) spin rotations, continuous 
spacetime translations and spatial rotations. Let us now briefly consider the effects 
of some phenomenologically important perturbations. These can be classified into 
two broad groups: perturbations controlled by external fields, and those intrinsic to 
the given material. 

The most natural tunable perturbation that the spins in (anti)ferromagnets can be 
exposed to is an external magnetic field, B. Insofar as its effect on orbital degrees 
of freedom can be neglected, the magnetic field couples directly to the conserved 
charge of G ~ SU(2): the total spin. It can therefore be treated as a vector-valued 
chemical potential. In the low-energy EFT, this is implemented by setting A,,(x) = 
ô 0B (x); the magnetic moment of the spins is absorbed into the definition of B. I 
will now show that treating B as a background gauge field allows us to make some 
exact statements about magnon spectrum. Importantly, we do not have to introduce 
any new arbitrary parameters into the Lagrangian. 


Example 9.6 


According to (9.55), the effect of an external magnetic field on ferromagnets is 
taken into account by adding the Zeeman term, MB - n(x), to the Lagrangian. 
As long as the magnetic field is uniform (which I will from now on assume), the 
ground state no will remain uniform as well. However, its orientation is no longer 
arbitrary, but rather has to be aligned parallel to B. The magnetic field selects a 
unique stable equilibrium state. The effect on the magnon spectrum is also easy 
to work out. The left-hand side of the Landau—Lifshitz equation (9.62) has to be 
modified by replacing ñ —> Don = ñ + B x n. Upon linearization, the plane-wave 
magnon solutions are still found to be circularly polarized in the plane transverse to 
no || B. The only effect of the magnetic field is a constant shift of the dispersion 
relation (9.65), 


Qs 2 
E =|B ~ p^. 9.67 
(p) = |B| + m? (9.67) 


The reason why the response of ferromagnets to a uniform magnetic field is so 
simple is that the conserved charge that B couples to remains unbroken. Energy 
levels can therefore be labeled by the projection of spin into the direction of B. The 
excitation energy of a state with spin S (relative to the ground state) will be shifted 
by the magnetic field by —S |B|. The ferromagnetic ground state is maximally 
polarized, and a single magnon carries a unit of spin less than the ground state. This 
explains why the magnon receives a gap equal to |B|. This is an exact result valid to 
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all orders of the derivative expansion; the ferromagnetic magnon is an example of a 
massive NG boson in the sense of Sect. 6.4.3. 


The effect of magnetic fields on antiferromagnets is somewhat less trivial. Accord- 
ing to (9.59), their LO Lagrangian (in absence of other perturbations than B) 
becomes 


LO = 53 {Ion (r) + Bx n(x) — v°8,n(r) - Asn(r)} . (9.68) 


The corresponding Hamiltonian density is 


HEP = 53 {dom ry? — [B x n(x)} + 75% 3 n(m) - Asn(zx)} . (9.69) 
The response of the ground state is quite different from ferromagnets: the energy 
is minimized by any mg L B. Hence the U(1) group of spin rotations around the 
direction of B is spontaneously broken. We expect the spectrum to contain one true 
NG boson, whereas the other magnon should receive a gap from the magnetic field. 

To see this explicitly, let us choose B = (0,0,|B|) and nọ = (1,0,0). We 
can use n? and n? as two independent fluctuations of the order parameter, and 
parameterize the latter as 


n(x) = (V1 = [m2 + nma, 0? (x), n E) . (9.70) 


To the second order in the fluctuations and up to a total time derivative, the 
Lagrangian then reads 


Zee 5 a XO [(on')? — v’ Vn - Vn'] + Bf - Tl +. (971) 


i=2,3 


This makes it clear that n2(x) remains gapless, as expected. On the other hand, the 
n>(x) mode receives a gap, its full dispersion relation being E(p) = yv? p? + B2. 
The conclusion that E (0) = |B| is exact. The spectrum of an ideal antiferromagnet 
in a uniform magnetic field contains one true NG boson and one massive NG boson. 


As opposed to the effect of external fields, perturbations induced by the 
underlying crystal lattice are intrinsic to the given material and therefore cannot 
be “switched off.’ A prominent position among these is occupied by anisotropy in 
either spatial or spin structure of the microscopic interactions. I will content myself 
with the simplest illustrative example of such a crystal anisotropy, whereby one spin 


9.2 Spin Waves in Ferro- and Antiferromagnets 197 


axis is distinguished from the other two, 
Len = EnF. (9.72) 


For € > 0, the perturbation favors spin alignment along the third axis; this kind 
of anisotropy is called easy-axis. In the opposite case of € < 0, spin alignment 
along the first or second axis is preferred. This is an easy-plane anisotropy. The 
effects of easy-axis and easy-plane anisotropy on ferro- and antiferromagnets are 
very different. It is therefore best to discuss them separately. In each individual 
case, I will proceed by first identifying the ground state and then expanding the 
Lagrangian to second order in fluctuations. 


Example 9.8 


Let us start with ferromagnets. In the absence of background gauge fields but 
presence of the anisotropy, the Lagrangian (9.57) becomes 


apb 
Eabh N Qs 85an - dn + e(n’)? : (9.73) 


PO — 
et l+n? 2 


In easy-axis ferromagnets, the ground state is unique up to overall sign, nọ = 
(0, 0, 1). The two independent fluctuations can be taken as n! and n2. Both the 
anisotropy and the ground state preserve the U(1) group of spin rotations around 
the third axis. We can thus look for normal modes as eigenstates of this symmetry. 
This motivates the introduction of a complex field, y = (n! + in?)/./2. Dropping 
the energy density of the ground state and expanding the Lagrangian to second order 
in y, we get 

LIO ~ iM dow — oV yt -Vy —2ew Ww H- . (9.74) 
This leads to a Schrédinger-like equation, describing circularly polarized spin waves 
with dispersion relation 


Qs 2 
~ p^. 9.75 
uP (9.75) 


E 2€ 
=F, + 
The anisotropy gives the magnon a gap since both generators of SU(2), sponta- 
neously broken in the preferred ground state, are also broken explicitly. In contrast 
to (9.67), the gap predicted by (9.75), E(0) = 2€/M, is only a LO result and will 
receive corrections at higher orders of the derivative expansion. The same is true for 
all the other magnon dispersion relations, derived below. The magnon has become 
a pseudo-NG boson. 

In the easy-plane case, any uniform state with (n?) = 0 minimizes the energy. I 
will choose the ground state as no = (1, 0, 0) and the independent fluctuations as 
n? and n°. Upon series expansion in the latter using the parameterization (9.70), the 
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Lagrangian becomes 

Leip ~ Mn? don? = YO Vni- Vn = lel) +. (9.76) 

i=2,3 


We already found the spectrum of this type of Lagrangian back in Sect. 3.2.1, I can 
therefore just write down the final result, 


2 lel 
E(p)= s Ipl,| p? + n (9.77) 
S 


The dispersion relation remains gapless but has become linear. This is because the 
U(1) group of spin rotations, left intact by the anisotropy, is now spontaneously 
broken. The anisotropy has turned the magnon into a type-A NG boson. 


Next we turn to antiferromagnets, which are now for a change much easier to 
analyze. In the absence of background gauge fields but upon adding the anisotropy 
term, the Lagrangian (9.59) turns into 


LIP = 523 [om — v8" a,n - Asn] + (n°)? . (9.78) 
This Lagrangian is diagonal in the spin index of n’. We can therefore read off the 
spectrum immediately upon identification of the ground state and its independent 


fluctuations. In easy-axis antiferromagnets, the ground state is ng = (0, 0, 1) up to 
a sign, and its fluctuations are n! and n*. These excite two gapped magnons, 


2v2e 


E1 2(p) = ,| V p? + (9.79) 


S 


We can choose the basis of independent spin waves freely, either as linear, circular, 
or generally elliptic. In the easy-plane case, on the other hand, we can choose the 
ground state as nọ = (1, 0, 0) and its fluctuations as n? and n?. Here we find two 
different excitation branches with different dispersion relations, corresponding to 
linearly polarized spin waves, 


2v2 |e 
Ex(p)=v|p| , E3(p) = |V p’ + m (9.80) 
S 
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The reason why one of the excitations remains gapless is that there is an exact U(1) 
symmetry, left intact by the perturbation, that is spontaneously broken. 


I will conclude the discussion of perturbations in spin systems with a very 
peculiar example that leads to fascinating phenomenology. Until now, I have ruled 
out the existence of god based on invariance under spatial rotations. However, 
one can, in fact, construct an operator with a single spatial derivative that does 
not break rotations, as long as two conditions are satisfied. First, parity must be 
broken, typically by the structure of the underlying crystal lattice. Second, spin— 
orbit coupling must be taken into account. This breaks the separate symmetries 
under spatial (orbital) and spin rotations to a single SU(2), under which spatial 
coordinates x and the spin vector n transform simultaneously. One can then add to 
the Lagrangian of (anti)ferromagnets the Dzyaloshinskii-Moriya (DM) term, 


27 Os 
LEM =n (V xn), (9.81) 


where Apm is a new parameter with the dimension of length. This length scale 
is fixed by the choice of concrete material, and is usually much larger than the 
scale of the underlying crystal lattice. For instance, in MnSi one finds Apm ~ 
18nm and in FeGe Apm ~*~ 70nm [18]. This justifies treating the perturbation 
coupling 277@;/Apm as a small parameter of degree one in the power counting. The 
Lagrangian a can then be consistently included in the LO of the derivative 
expansion. 


Example 9.10 


The ground state induced by the DM interaction can be discussed jointly for ferro- 
and antiferromagnets if one restricts to time-independent spin configurations. In the 
absence of external gauge fields and anisotropy, the LO Hamiltonian then reduces 
to 


27 Os 
ADM 


HOO = Saran -ðn + n-(V xn). (9.82) 


The condition for minimum energy follows by “completing the square,” 
; 27 Os 
HOD ~ SAV on)? + (V x n)?] + nV xn) 
2 ADM 


9.83 
TE (9.83) 


; 2 2 
-$V +E (vxn4 Za) J 
2 2 ADM DM 
The second term will be minimized by any spin configuration satisfying the first- 
order differential equation V x n = —(27/Apm)n. This guarantees without 
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further conditions minimization of the first term, (V - 2), and thus of the whole 
Hamiltonian. 

Let us now temporarily forget that n(x) should be a unit vector at any x, and 
Fourier-transform it to momentum space. The complex amplitude np of the plane 
wave with momentum p is then subject to the condition ip x np = —(27/Apm)Np. 
This requires that p L np and |p| = 27/Apm for any p such that np is 
nonzero. This gives the parameter Apy the interpretation as the wavelength of a 
spatially inhomogeneous order, induced by the DM interaction. An arbitrary linear 
combination of plane waves with fixed | p| is, however, not allowed by the constraint 
n-n = |. Without going into further technical details, let me spell out the final result. 
The ground state corresponds to a real helix that is right-handed for Apyy > 0 and 
left-handed for Apm < 0. The direction of the axis of the helix is arbitrary and 
spontaneously breaks the symmetry under spatial and spin rotations. Choosing it for 
illustration along the z-axis, and n to point along the x-axis at z = 0, the ground 
state becomes 


(n(x)) = (cos 2xz/àpm, sin 27 z/ApM, 0) . (9.84) 


In presence of the DM interaction, the spin system becomes a helimagnet. 

The magnon spectrum of helimagnets is no less fascinating than their ground 
state. In case the underlying spin order is ferromagnetic, there is a single magnon 
branch (cf. Example 6.5). It has an unusual, strongly anisotropic dispersion relation. 
Along the axis of the helix, the energy is linear in momentum for wavelengths much 
longer than Apm, whereas in the two transverse directions it is quadratic [19]. One 
can show in addition that in the former case, the spin wave is linearly polarized, 
while in the latter case it is polarized circularly as usual in ferromagnets. 


9.2.4 Some Topological Aspects of Ferromagnets 


The list of interesting features of spin systems does not end with the multitude of 
textures that can be induced by various perturbations. In particular ferromagnets 
exhibit a number of intriguing topological properties that can be traced to the 
part Eas of the effective Lagrangian. A brief sample is presented below. A 
reader wishing to learn more about the topology of spin systems is encouraged to 
consult [20]. 

Let us start by rewriting the single-time-derivative part of the ferromagnet 
Lagrangian (9.55) in a way that exposes its geometric nature. Suppose the field 
n(x,t) converges to a constant for £ — +00; this is a usual assumption when 
setting up the variational principle for fields. Viewed as a function of t and t, n then 
maps [0, 1] x R to a “disk” D on S? whose boundary I carries the physical field 
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Fig. 9.3 Geometric 
interpretation of the 
single-time-derivative term in 
the effective 

Lagrangian (9.55) for 
ferromagnets. The spin 
configuration n(x, t) maps 
the time axis to a closed curve 
T on the coset space S?. 
Manifest invariance of the 
Lagrangian under SU(2) spin 
rotations can be saved at the 
cost of extending the 
integration from T to the disk 
D (shaded area) whose 
boundary is T 


(t = 1). The action defined by (9.55) can be cast as 


Ship {n, A} = m [ a f Qin](x, t,t) +e, (9.85) 
D 


where Q[n] = n- (0;n x 0,n)dt A dt; the ellipsis stands for the spacetime integral 
of the second and third term in (9.55). Note that n - (dn x dn) is the area form on 
S?. The action is therefore determined geometrically by the area of the domain on 
S?, bounded by the curve I’; see Fig. 9.3 for a visualization. 

But how do we know which domain? I showed already that smoothly varying the 
interpolation m(t, 7) between the fixed limits at t = 0 and t = 1 does not change 
the action. However, there are distinct classes of maps that cannot be smoothly 
deformed into each other. We could, for instance, think of the interpolation as 
filling the domain D shown by shading in Fig. 9.3, or its complement on the sphere. 
There is no a priori way to distinguish the “inside” and “outside” of the curve I. 
The interpolation n(t, 2) could even cover the whole sphere multiple times, before 
converging to the curve I" in the limit rt — 1. The only way around this intrinsic 
ambiguity is to ensure that it is not physically observable. 

To that end, consider two domains, Dı and D2, swept by two different interpo- 
lations nı (t, 7) and n2(t, 7) of the same physical field, nı (1, x) = n2(1, x) = 
n(zr). We can glue the two maps into a single one, A(t, 7), T € [0, 2], by setting 


A(t, 7) =n (tT, 7) for t € [0, 1], 
(9.86) 
A(T, m) =n2(2—-T,7) fort € [1,2]. 
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This new map satisfies n(0, 7) = n(2, 7) = no. Thanks to the compactification of 
the time axis to the circle S1, we can think of it as a map S? —> S? with the domain 
spanned by the variables t,t. The points t = O and t = 2 are the poles of the 
domain and t = 1 is the equator. As a consequence, 


f Q[n] -f Q[n2] = 4r w[i] , (9.87) 
Dı Dı 


where w[ñ] is the integer-valued Brouwer degree (A.145) of ñ. The conclusion 
is that the classical action of a ferromagnet suffers from a topological ambiguity, 
shifting it by 4m MV w[ñ] where V is spatial volume. The functional integral of 
the EFT as a quantum theory will still be well-defined provided the action is only 
ambiguous up to an integer multiple of 277. This requires that the total spin MV be 
quantized in half-integers. The low-energy EFT for ferromagnets, constructed solely 
based on the geometry of the coset space, “knows” about quantization of spin! 

Now that we have established the topological nature of the 2, ae Lagrangian, 
let us look at some of its consequences. The most immediate one is the presence 
of a Berry phase; see Sect. 1.5 of [21] for a general introduction and [22] for the 
application to EFTs for NG bosons. Let us denote the quantum-mechanical ground 
state of a ferromagnet, corresponding to the order parameter (n) = no, as |Mo). 
Suppose we expose the ferromagnet to a weak, uniform magnetic field B. This 
forces it to align its magnetization with the field so that (n) = n(B) = B/ |B|, with 
the corresponding vacuum state |n(B)). 

Consider now a time-dependent magnetic field B(t) and arrange the ferromagnet 
state |y(0)) at t = O to be the vacuum state |n(B(O))). If the variation of B(t) 
with time is sufficiently slow, the evolution of the state of the system |y(t)) will be 
adiabatic. We can set the ground state energy E to zero to eliminate the trivial phase 
factor of e~!“". Even then, we cannot expect |y (t)) to equal |n (B (t))). Rather, 


IY (t)) = expliyg()] Ia (B @))) , (9.88) 


where yg (t) is the Berry phase. Taking the time derivative of (9.88) and projecting 
the result to |W(t)) leads to yg(t) = i(n(B(t))|d9|\n(B(t))). The total phase 
accumulated over an interval [f1, t2], i yB (t) dt, only depends on the path followed 
by B, not on its precise time dependence. To underline the geometric nature of the 
Berry phase, we introduce the Berry connection, 


op(B) = 1(n(B)|d|n(B)) , (9.89) 


which is a 1-form on the space of all quantum ground states of the ferromagnet. The 
gauge freedom associated with the Berry connection amounts to the arbitrary choice 
of phase of |n(B)) for every value of B. Since the state |n(B)) only depends on the 
direction of B, the Berry connection naturally induces a 1-form on the coset space 
S? of the ferromagnet. When the magnetic field varies so that it traces a closed loop 
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in the parameter space, the total Berry phase is 
yp (1) = i WB a dwg . (9.90) 
r D 


Here T is the loop on S? traced by the order parameter in the process and D the disk 
bounded by it, as in Fig. 9.3. The 2-form da is called the Berry curvature. 

To see the connection between the Berry phase and the effective Lagrangian 
for ferromagnets, recall the parameterization of the coset space by matrices U (zr). 
With a slight abuse of notation, one can write |n(B)) = U(2(B)) |no). The Berry 
connection then reads 


wp = i (no|U (r) 'dU (at) |n0) = — (nolo (zr) m0) = -MV ar (7) . (9.91) 


Comparing this to (9.53), we see that up to the factor of volume, the Berry 
connection is precisely the 1-form that defines the Lagrangian density wee, On 
the other hand, it follows from (9.85) and the Stokes theorem that —dw® is the area 
form on S?. The Berry phase (9.90) therefore equals the total spin of the system 
times the area of D. 

Mathematically, the area 2-form is the single generator of the second de Rham 
cohomology group of S*. The topological features of we D stem from pulling this 
2-form back to the space of variables t, t. However, interesting physics also arises 
from pulling it back to the whole spacetime. For simplicity, I will now restrict to 
d = 2 spatial dimensions. The resulting closed spacetime 2-form is then Hodge- 
dual to a current, conventionally normalized as 


1 Be Hee ail 
J“[n] = ae euen dyn agnk . (9.92) 


This current is conserved off-shell, similarly to the GW current introduced in 
Sect.9.1.4. Following the analogy, we expect the density J°[n] to give rise to 
a topological charge, that is a topological invariant of the field n(x, t). Indeed, 
[alex J°[n](x, t) is just the Brouwer degree w[n] of the map n : R? > S°. In 
d = 2 dimensions, there are smooth spin configurations carrying nonzero value 
of w[n], called skyrmions, or sometimes baby skyrmions, to distinguish them from 
their counterparts in QCD. See the dedicated monograph [20] for further details 
about this fascinating subject. 

Recall now the symplectic approach to ferromagnets, outlined in Sect. 4.3.1. It 
is easy to check explicitly that as a functional on the phase space of a ferromagnet, 
w[n] has a vanishing Poisson bracket with any other functional. This is an example 
of a general feature that topological charges do not generate any flow on the phase 
space. Let us see what happens if we deform w[m] by inserting an arbitrary function 
of spatial coordinates. Changing for convenience the overall normalization, we 
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define a class of functionals, 


_M 2 rs i J k 
Ostul= = f ax fae erjen (x)d-ni (x)asn* (x) . (9.93) 


It is a simple exercise to verify that the Poisson algebra of these functionals 
reproduces the algebra of functions on R?, 


{Q, Qc} = Qersa, fag = Qt fg} - (9.94) 


In addition, the functional Q f generates a flow on the phase space through 


{n(x), Oy[n]} = ed, f(x)dsn(x) . (9.95) 


Curiously, this makes it possible to identify P” = Qs,„xs with the generator of 
spatial translations, that is momentum. The real surprise comes now: according 
to (9.94), 


{P" [n], P [n]; = 427 Me™ w[n]. (9.96) 


For skyrmions, which carry nonzero w[n], the two components of momentum do 
not commute with each other. This has a simple physical interpretation. Namely, for 
nonzero w[n], 


eS P,{n] 
4rMwjn] 
(9.97) 


1 , , 
x[n] = go | Eeer a An") = 


represents the center of the topological charge distribution. Conservation of momen- 
tum in the absence of external forces then implies conservation of the center of 
charge. Skyrmions are pinned to a fixed position unless an external field is applied 
to them. The fact that {x7 [n], x},[n]} = £" / (47x M w[n]) hints that the dynamics of 
skyrmions closely resembles that of a charged particle in a magnetic field. 
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| Check for 
updates 


In our survey of spontaneous symmetry breaking (SSB), we have so far been mostly 
concerned with the ground state and excitation spectrum. Yet, the effective field 
theory (EFT) framework developed in Chap. 8 captures full nonlinear dependence 
of the action on the Nambu—Goldstone (NG) fields. It thus also includes all 
interactions among NG bosons and their interactions with other excitations, if 
present. Microscopic interactions in quantum systems may have many different 
macroscopic manifestations. However, there is an important class of observables 
that are amenable to both systematic computation and experiment: scattering 
amplitudes of free asymptotic states. I will therefore conclude the part of the 
book devoted to spontaneous breaking of internal symmetry with a short primer 
on scattering of NG bosons. 

I will start in Sect. 10.1 by revisiting the Adler zero principle that scattering 
amplitudes of NG bosons tend to vanish in the long-wavelength limit. I will offer a 
general justification of this phenomenon, and underline the nature of exceptions 
to the rule. A modern approach to scattering of NG bosons that utilizes the 
geometry of the coset space of broken symmetry is introduced in Sect. 10.2. In 
Sect. 10.3, I demonstrate that there are exceptional EFTs whose long-wavelength 
scattering amplitudes are even softer that naively expected from the Adler zero 
principle. These theories set a benchmark for many modern developments in the 
rapidly expanding field of scattering amplitudes. One important line of development 
concerns the recursive reconstruction of all (tree-level) scattering amplitudes from 
a finite number of “seed” amplitudes. This is the subject of Sect. 10.4. 

Scattering amplitudes as observables are arguably more important in fundamen- 
tal (subatomic) physics than in other branches such as condensed-matter physics. I 
will therefore restrict the discussion to Lorentz-invariant systems. Here NG bosons 
are massless particles and the kinematics of their scattering is well-understood. 
Much of the material of this chapter can however be generalized to nonrelativistic 
EFTs with continuous spatial rotation symmetry; see [1,2] for details. Let me 
stress that even amplitudes in relativistic systems have grown into a vast subject, 
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of which this chapter only exposes a small corner. A reader interested in a broader 
introduction to the physics of scattering amplitudes is recommended to start with 
the lecture notes [3] or the book [4]. 


10.1 Adler Zero Revisited 


One of the hallmarks of SSB is the existence of a local conserved current J” that 
couples to the NG boson state; cf. the proof of Goldstone’s theorem in Sect. 6.2.1. 
This can be used to extract nonperturbative information about scattering processes 
involving one or more NG bosons. We start with a generic process whose initial 
and final states |œ) and |£) may include any set of particles of arbitrary mass and 
spin, and inspect the matrix element (8|J“(x)|a@). This receives two qualitatively 
different types of contributions, schematically shown in Fig. 10.1. We shall focus on 
the first contribution, which amounts to the creation or annihilation of a one-particle 
NG boson state by the current. 

The NG boson may be an elementary excitation of a field in the action. 
The contributions with an intermediate one-particle NG state then correspond to 
one-particle-reducible diagrams where cutting a single propagator disconnects the 
current from the a — f process. However, the NG boson may also be composite. 
The NG state then reveals itself through a single-particle pole in the matrix element 


(B|J“(x)|a) at energy-momentum p” = p% — ph. Contributions to the matrix 
element without such a single-particle pole are represented by the second diagram 
in Fig. 10.1. 


We use the translation property of the current operator, J# (x) = e? * J#(O)e7!?*, 
where P* is the operator of energy-momentum. This holds whenever the current 
does not depend explicitly on spacetime coordinates, which is the case for internal 
symmetry. According to the rules of polology (Sect. 19.2 of [5]), the residue of 
(B| J“(x)|a) at the one-particle NG pole is then e~!?** (0| J“ (0)|(p)) Aa p+x(p): 
Here Ag—f+2(p) is the invariant amplitude for the process where a NG boson of 


pI“ ae 
I 
l 
l 


Fig. 10.1 Sketch of contributions to the matrix element (£| J” (x)|æ). In the former, the current 
creates an intermediate one-particle NG state |2(p)) (dashed line) that couples to |æ) and |8) 
through the amplitude Ay_, g47(p)- In the latter, the current connects to |æ}, |8) via a multiparticle 
intermediate state 
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momentum p is added to the final state.! With a Lorentz-invariant vacuum |0), 
the matrix element (0|J/“(0)|(p)) transforms as a Lorentz vector, as one readily 
verifies using the properties of single-particle states (Sect. 2.5 of [6]). Lorentz 
invariance therefore dictates that (O|J"(0)|z(p)) = i Pon F , where F is a constant 
and pb, = (|p|, p) is an on-shell energy-momentum with spatial part p. The 
matrix element (6|J“(0)|«) can now be split into a pole part and a regular part as 


1 
(BI Oa) = (0| J" OE) pate Btn) + Rea (P) > (10.1) 


where (0| J“ (0) | (p)) or = ip” F. The as yet unspecified function Re a (P) aaa 
all the nonpole contributions, and thus remains regular in the on-shell iit p> > 0. 


Polology suggests that the numerator of the pole in (10.1) should be pro- 
portional to (0| J” (0)|z(p)). Replacing this with (0| J” (O)|z (p)) o¢¢ amounts 
to trading pón for p“, and in turn to a redefinition of Rea (p) without 
changing the residue at the pole. The reason for doing so will become clear 
below when we work out an explicit example. Here let me just stress that 
(0| J* (O)|z¢ (p)) of iS a mere symbolic notation for ip” F. This object is not a 
well-defined matrix element but rather depends on the energy—momenta py 
and pg. 


We now combine (10.1) with current conservation expressed as p, (B|J"(0)|a) = 
0. This leads to a master identity which allows one to reconstruct the on-shell 
scattering amplitude from the remainder function R% (P), 


i H 
T Pu RgalP) - (10.2) 


Aa p+x(p) = F 


Example 10.1 


Recall the toy model (2.1), introduced in Chap. 2. To simplify the notation, I will 
only consider the limit of exact symmetry, € — 0, and disregard the fermionic 


1 Tn contrast to Chap. 6, I use here the normalization (x (p)|7(q)) = (27 )42 |p| 54 (p — q) of the 
one-particle NG states. The amplitude Ay_,g for a generic process œ — £ is normalized by its 
relation to the S-matrix, Sy+p = —(27)P ig po? (Pa — pp). Finally, adding the NG boson to 
|B) rather than |œ) is a convention, amounting to the choice of orientation of p”. 
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degrees of freedom. In the linear parameterization of the complex field, ¢(x) = 
e? [v + x(x) +ixr (x)]/V2, the axial current in (2.3) becomes 


JË = —2i(¢*0"b — aM") = 2v3” r + 2x d"a — 1d" x) , (10.3) 


whence we read off F = —2v. Let us have a look at the process xz — xz in which 
a NG boson scatters off a massive (Higgs) particle. We a læ) = |x (pı)x(p2)) and 
|B) = |x (p3)). The “missing” energy-momentum ph is supplied by the current. 
Using the Feynman rules (2.15), the remainder function Ri x (p4) collecting nonpole 
contributions to (8| J” 'a (O)|@) turns out to be 


wo ed YY 


ay 2P3 + pa)" 


2p; — pa)” 
_ Diav (2p, — pa) 
S 


-2 
(p4 = 2p2)! + 4iav 
t-m u 


(10.4) 


where the solid squares indicate an insertion of the axial current operator. Contract- 
ing this with ph gives 


ipa, R” (p ) = Aao 1+ (-+—3+7)| (10.5) 
Bes: *\s t-m? f ` 
Using finally (10.2), this reproduces the previously calculated amplitude (2.20). 
Note that when evaluating the remainder function Ri _(P4), I ignored the 
contributions of Feynman diagrams where an external 2 ja leg terminates in 
the current operator. These arise from the part of the current linear in x, and are 
included in the pole part of (10.1). Had we used therein the on-shell matrix element 
(O| J“ (0)|z(p)), we would have had to include in (10.4) the contributions of the 
mentioned diagrams with the pole canceled by the numerator factor p” — phn. At 
the same time, the right-hand side of (10.2) would receive an additional factor of 2, 
owing to the fact that the on-shell limit of p - Pon/ p° is 1/2. The conclusion that the 
amplitude can be reconstructed from Ry (P4) would however remain. 
It is instructive to reproduce the same result also with the other parameterization 
considered in Chap. 2, d(x) = el? exp[iz(x)/v][v + x(x) /V2. In this case, the 
axial current reads 


2 
JË = 2vd"n + 4yd"n + =x ax , (10.6) 
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whereas the Feynman rules change to (2.28). The presence of the cubic term in the 
current adds another Feynman diagram to the expression for the remainder function, 


Req (P4) = \ uf 4 DE $ / 4 VY 


i 2 a oH : 2 +H 
Aj s—m 241Avp 4ju—m 4ip 
=— * (p3 + p4)” = “(pi = p4)” Z, 
f= my ¥ u 
(10.7) 


This reproduces the same scattering amplitude, in the form shown in (2.30). 


What is most interesting about (10.2) is the extra factor of p, on the right-hand 
side, arising from current conservation. This shows that 


lim Ag+ pir(p) =9, (10.8) 
p-0 


provided the remainder function is nonsingular in this limit. This is the Adler zero. 
The same conclusion applies to processes with a NG boson inserted in the initial 
state. This distinction is trivial in Lorentz-invariant systems; a pedagogical proof 
of (10.8) for systems with mere spatial rotation invariance can be found in Sect. 3.1 
of [1]. I will not reproduce the details here since they have no bearing on the result. 
The function Roa (p) is by construction nonsingular in the on-shell limit p? — 0. 
The absence of singularities for p — 0 (or equivalently p” — 0) is an additional 
assumption. Finding a mechanism that leads to a violation of this assumption is 
required for understanding the origin of exceptions to the Adler zero principle. 


10.1.1 Generalized Soft Theorem 


Before we embark on a more detailed analysis of the conditions under which 
the Adler zero (10.8) is realized, let me introduce some terminology. Taking the 
momentum of a massless particle (a NG boson or, for instance, a gauge boson) to 
zero is generally referred to as the (single) soft limit. The statement of Adler zero is 
equivalent to the vanishing of the soft limit of scattering amplitudes of NG bosons. 
A universal statement about the soft limit of scattering amplitudes such as (10.8) 
is called a soft theorem. We will see that even in theories where the soft limit of 
A> B-+n(p) iS nonvanishing, one can still establish a generalized soft theorem. This 
relates the soft limit of A«—g+r(p) to the amplitude Ag_,g with the NG boson 
removed. 
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We noticed in Chap. 2 that scattering the NG boson off the fermion within our 
toy model violates the Adler zero principle. Operationally, this could be understood 
as a consequence of the singularity of the fermion propagator in (2.21) in the soft 
limit for the NG boson. This singularity in turn arises from the cubic coupling 
between the NG boson and the fermion. The origin of the singularity is however 
purely kinematical and not specific to coupling of NG bosons to fermions; the same 
obstruction to the Adler zero may occur in purely bosonic theories. It is, in fact, 
commonplace in EFTs where all degrees of freedom are NG bosons, as constructed 
in Chap. 8. The Adler zero is therefore a much less robust consequence of SSB than, 
say, the very existence of gapless NG bosons. 

Let us now examine the hypothesis that the violation of the Adler zero principle 
arises from the presence of cubic interaction vertices in the theory. To keep things 
simple, I will from now on restrict to relativistic EFTs of NG bosons, disregarding 
any non-NG degrees of freedom. The argument below closely follows [7]. 

Consider a set of NG fields x“ in a theory defined by the generic Lagrangian 


1 1 , 
L= 5 bab dy" ax” + x habe dpm" nn" + O(r*, a4). (10.9) 


The coupling abc can be assumed to be symmetric in its a, b indices. There are 
no terms without derivatives as guaranteed by the broken symmetry. Also, we can 
restrict the discussion to operators with two derivatives in line with the power 
counting laid out in Sect. 9.1.1. Finally, I assume that the kinetic term is diagonal 
in the NG flavor indices and properly normalized. This can always be ensured by a 
linear transformation of the NG fields. Other than that, however, I do not implicitly 
assume any particular parameterization of the fields. 


The first thing to notice is that the cubic vertex in (10.9) alone cannot be 
the sole culprit in violating the Adler zero principle. Observable properties of 
scattering amplitudes must be unchanged by a field redefinition of the type 


1 
nw? > nr) =n? + zber n" +0?) , (10.10) 


where che is symmetric in b, c. In particular, choosing c} c= (/ 2574 (Adbe + 
Adcb — Abca) eliminates the cubic interaction vertex altogether. In fact, even 
cubic operators with more than two derivatives can be removed analogously. 
This redundancy of cubic couplings of NG bosons is inherent to relativistic 
EFTs for SSB. 


For the Lagrangian (10.9) to actually represent an EFT for NG bosons, it must 
preserve a symmetry that is spontaneously broken. We do not need to invoke the 
full machinery of nonlinear realizations at this stage. It is sufficient to assume the 
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existence of a set of spontaneously broken symmetries, one for each NG field, 
òn" = [F4 + G4 no + O’). (10.11) 


The independence of these transformations and the fact that they are all sponta- 
neously broken amount to the condition that Fy’ is an invertible square matrix. The 
assumed invariance of (10.9) then leads to the following set of Noether currents, 


JË = bye FEIÄ? + Kapd n!e +O?) , (10.12) 
where Kabe = Sba G2, + FO) ie: 
Let us now evaluate, schematically, the amplitude Ag. g+47«(p) for a process 


where |@) ,|8) are states with an arbitrary number of NG bosons. The soft NG 
boson state |72“(p)) is added to the final state |8). Then (10.1) is replaced with 


1 
(BIJE Ola) = $ (Ol J OT” (P)) oft “2 Fhe pnrip) + Rha?) (10.13) 


b 
From (10.12) we find that (0| J¥ (O |z? (p)) = —iph F2. Applying momentum 
conservation leads to `; F? Ay p4n(p) = —ipy Réoa (p). We expect the Adler 


zero property of Ay_,g4(p) to be violated if the limit p — 0 makes some 
of the propagators in the process on-shell. Barring fine tuning of the kinematics, 
this happens when the NG state |x? (p)) is “attached” to an external leg of the 
a — ß process via a cubic interaction vertex. In terms of Rea a(p), such singular 
contributions map to insertions of the bilinear part of J} into one of the external 
legs of a — £. See Fig. 10.2 for a visualization and the index notation. 


Fig. 10.2 Sketch of nonpole contributions to (B|J/‘(0)|w) that are singular in the limit 
p” = pk — ph — 0. These contributions arise from inserting the bilinear part of the Noether 
current (10.12) into an external leg of the a — £ process. The insertion is indicated by the solid 
square 
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Near the limit of vanishing p”, inserting the current into an external leg with 
flavor b, carrying energy-momentum ph, gives the singular contributions 


i 


Riva (P) D X (IAB) [ -iKaveph +iKacb(pPo — p)"] 
c 


(Pb — p}? 
a i b>c Ph 
on 2 Le P- De (Kabe — Kacb) . 


(10.14) 


The superscript b — c indicates a replacement of the flavor index on the leg where 
the insertion has been made. Upon summing over all possible insertions, we arrive 
at the final form of our generalized soft theorem, 


; 1 5 
lim > Fy Ags peut = 5 NO YO (Kate — Kach) EZG - (10.15) 
b beaUB c 


Let me spell out some properties of this result explicitly. First, the soft theo- 
rem (10.15) relates the soft limit of an n-particle amplitude to a set of (n—1)-particle 
amplitudes, in which the soft NG leg is removed and the label on one other leg is 
changed. The only other ingredient needed is the set of algebraic coefficients Kabc, 
which is independent of the concrete scattering process considered. In this sense, 
the soft theorem (10.15) is universal. 

Second, as is evident from (10.12), the coefficients Kapc receive two types of 
contributions: one from the cubic coupling Ago, the other from the part G4, of the 
symmetry transformation (10.11), linear in NG fields. Importantly, the combination 
Kabc — Kacb, antisymmetric under the exchange b <> c, is manifestly invariant 
under the field redefinition (10.10). This ensures that the soft theorem (10.15) has a 
physical meaning independent of the choice of field parameterization. 


10.1.2 Application to Coset Effective Theories 


The above argument is fairly general and in principle also applies to scattering 
amplitudes in theories where some of the particles in |œ} and |£} are not NG bosons. 
The only essential constraint is that there are no cubic operators without derivatives 
in the Lagrangian. The sum in (10.15) is then to be restricted to c such that 2° 
has the same mass as zr”. However, once the Lagrangian (10.9) does represent an 
EFT with only NG boson degrees of freedom, we can use the wealth of information 
accumulated in Chaps. 7 and 8. 
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Let us recall some of the basic relations that we are going to use here. First, the 
two-derivative part of the effective Lagrangian for the coset space G/H reads 


1 
z= z edog mo (Oy IM? f (10.16) 


where the constant symmetric matrix Kap satisfies the constraint ff, Keb + FepKac = 
0. The index « labels generators of the unbroken subgroup H. With the exponential 
parameterization of the coset space, U(x) = exp(iw“Q,), the Maurer—Cartan 
(MC) form can be computed explicitly as a power series in the NG fields, 


1 
of (n) = ôf — : An? + O(n") , (10.17) 
where the index A runs over all generators of G. Likewise, infinitesimal transfor- 
mations induced on the coset space by G, 87 = Age (7), are determined by a set 
of Killing vectors, 


E4 (7) = 64 — (ri - 58553) n? + O(n?) . (10.18) 


To make the bilinear (kinetic) part of the Lagrangian (10.16) canonically normal- 
ized, one can set Kap = dap by a suitable linear transformation of the NG fields. 
To preserve the form of the exponential parameterization and hence of (10.17) 
and (10.18), this has to be compensated by a linear transformation of the basis of 
broken generators Q,. That in turn affects the values of the structure constants. 
Below, I will assume that such a choice of basis of broken generators has already 
been made. 

With all these pieces at hand, it is easy to construct the Noether currents for the 
spontaneously broken generators of G, 


(2) 
H OL rf 


b 
" T (zr) 


(10.19) 
1 
= ôapð TË — (af bs su fs) ann + O(n) . 


By matching this to (10.12), we can read off the algebraic coefficients needed in the 
soft theorem (10.15), namely F? = 8L and 


Kabe — Kacb = bca f5, m. Spa fL = baa fE . (10.20) 
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Example 10.2 


Let G be compact and semisimple. Using a faithful matrix representation of G, one 
can define the Cartan—Killing form on the Lie algebra g as Aag = tr(Q4 QB). The 
cyclicity of trace, tr([Q4, Qg]Qc) = tr(Q4[Qp, Qc]), implies the identity 


fes ve + fepAav =0. (10.21) 


This encodes the invariance of the Cartan—Killing form under the adjoint action 
of G on g. At the same time, (10.21) ensures that the covariant structure constant 
faBc = ^A D ffc is fully antisymmetric in its three indices. 

It is convenient to choose the basis of broken generators Qa so that the space 
g/b is “orthogonal” to h, that is, Aag = 0. The unbroken and broken indices can 
then be raised and lowered independently using Agg and Aap, respectively. The 
constraint ff, Keb + f4pKac = 0 can be rewritten as K^, ffp — fU.K5, = 0. Thus, 
the matrix «“, commutes with the representation of H on the space g/b. Suppose in 
particular that this representation is irreducible, that is, all the NG modes of G/H 
span a single irreducible multiplet of H. This is the case for a number of physically 
important coset spaces such as SO(n + 1)/SO(n) ~ S”, ISO(n)/SO(n) ~ R”, or 
GL x Gr/Gy with simple G. Then by Schur’s lemma, «“, must be proportional 
to 5p and thus Kap to Aap. The choice kap = ĉap here corresponds to the common 
normalization of symmetry generators such that tr(Q4 Qg) x d,pz. Finally, (10.20) 
reduces to 


Kabe = Kacb x feat = fbac _ fabe = fabe : (10.22) 


In this special case, the right-hand side of (10.15) becomes (1/2) X > favo AZG 
up to an overall factor. beaUp c 


It is obvious from (10.15) and (10.20) that a sufficient condition for the 
presence of Adler zero is that the coset space G/H is symmetric so that ffp = 
0. In fact, for symmetric coset spaces, the reason for vanishing of the right- 
hand side of (10.15) is twofold. Namely, in the exponential parameterization, the 
automorphism R(U (x)) = U(s)~! is equivalent to the inversion m! —> —r%. 
As a consequence, the components of the broken part of the MC form, wœ% (r), 
are manifestly even functions of the NG fields. Hence, for symmetric coset spaces, 
the effective Lagrangian (10.16) can only give scattering amplitudes with an even 
number of particles. For theories with only even interaction vertices, the soft 
theorem (10.15) applied to an amplitude with an even number of particles yields 
Adler zero without any further requirements on the coefficients Kapc- 
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10.2 Geometric Framework for Scattering Amplitudes 


In the analysis in Sect. 10.1.1, the freedom to redefine the field variables 7“ was 
something of a nuisance. Indeed, we had to demonstrate explicitly that our soft 
theorem (10.15) is invariant under field reparameterization and thus physically 
meaningful. Here we will take control and make the freedom to redefine fields work 
for us. The discussion in this section loosely follows [8]. 

Suppose we are given a set of fields x° taking values from some manifold M. 
For the moment, we do not even have to assume that these are NG fields and M is 
a coset space, M ~ G/H. All we need is that the low-energy physics of the system 
is captured by the Lagrangian 


1 
L= 58b) uT Bn” : (10.23) 


In particular, this requires that there are no interactions without derivatives, and 
any higher-derivative operators can be treated as subleading. Under a point trans- 
formation, 7“ —> x(x), the symmetric matrix function gap(7) on M behaves 
as a rank-2 covariant tensor. Moreover, it must be positive-definite in some 
neighborhood of the origin, 7“ = 0, to give a kinetic term with the correct signature. 
It can therefore be interpreted as a Riemannian metric on the target manifold M. 

Perturbative interactions among the fields 7“ are generated by expanding the 
metric in powers of the fields. Individual interaction vertices are thus determined by 
derivatives of the metric at the origin. However, physical observables must at the 
same time be invariant under an arbitrary redefinition of the fields. We can use this 
fact and switch to the Riemannian (geodesic) normal coordinates on M, in which 
(see Appendix A.6.4) 


1 1 R 
Sab (0) = gab (0) — 37°" Racha (0) = gi T n° (VeR)achd (0) + O(r*) . 
(10.24) 


Moreover, we can always assume (and I will henceforth do so) that ggp(0) = dap 
so that the kinetic term in (10.23) is canonically normalized. This shows that 
any observable such as an on-shell scattering amplitude depends on gap(7) solely 
through the Riemann curvature tensor and its covariant derivatives at the origin. 
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To see how the curvature tensor enters the amplitudes, let me spell out explicitly 
the simplest cases of the 4-particle and 5-particle amplitude, 


Aa arazay (P1, P2, P3, p4) = i = —S24Raarazay = 534Ra,azanay > 


Aaiaazasas (P1; P2, P3, P4, Ps) = i 


= 845 Va3Rayayanas = $25 V a, Rajazazas 
E s35 Va Raiazaas = $34Va5Rayazaray 


E (s24 + s45)Ña;Raimazas . 
(10.25) 


Here I have introduced a compact notation for a scattering amplitude with parti- 
cles of energy-momentum p” and flavor a;. All the energy—momenta are (now 
and henceforth) oriented inwards. The variables s;; = (pj + pj)” generalize the 
standard Mandelstam variables to processes with an arbitrary number of particles. 
Finally, I have dropped the argument (0) of the curvature tensor for notation 
simplicity. 


The presentation of the amplitudes is highly ambiguous due to the redundan- 
cies among the Mandelstam variables and the components of the curvature 
tensor. To arrive at the expressions in (10.25), I have adopted the following 
conventions. First, the variables sı; with any i > 2 can be eliminated 
through sı; = — `D j2=25ji» which follows from overall energy-momentum 
conservation for massless particles. Second, one of the remaining Mandelstam 
variables can be eliminated by means of 0 = Dy = (p2 + p3+---)*. I use 
this to get rid of s23. Similarly, all components Rgjaja,a, Can be expressed 
solely in terms of those for which i < min(j, k, /) simultaneously with k < l 
and j < max(k,/). This is guaranteed by the symmetries of the curvature 
tensor and the algebraic Bianchi identity (see Appendix A.6.3). Finally, the 
differential Bianchi identity (A.99) can be utilized to remove Va Raja jaga 
where m < min(k, L). 


With the above remarks out of the way, let us have a look at the result. First, the 
4-particle amplitude vanishes in the limit where any of the energy-momenta p” is 
taken to zero. This boils down to the very special kinematics of relativistic 4-particle 
scattering processes. The 5-particle amplitude is more interesting. Our expression is 
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designed to make the soft limit for the last particle transparent,” 


lim Aa arasasas (P1, P2, P3, P4, Ps) = —524 Vas Rayayayay m 834 Vas Rajazaza4 
ps> 


= Vas Ania (pi, P2, P3, p4) . 
(10.26) 


This is a special case of a geometric soft theorem, 


lim Aaa, (Pls «+++ Pn) = Va, Aapan (Pls -+s Pn-1) (10.27) 
Pn>0 


valid for any scalar theory that does not contain nonderivative interactions. There 
may be other, higher-order derivative couplings additional to (10.23). The covariant 
derivative of the amplitude in (10.27) is taken in the target space M with respect to 
the Levi-Civita (LC) connection defined by the metric gap (7). 

The reader is referred to [8] for a general nonperturbative proof of (10.27). 
Here we are mostly interested in EFTs for NG bosons, defined on a coset space 
of spontaneously broken symmetry. For this class of theories, I will derive below a 
dedicated geometric soft theorem using the more elementary approach of Sect. 10.1. 


10.2.1 Geometric Soft Theorem for Nambu-Goldstone Bosons 
Inspired by the above discussion, we shall now reinterpret the results of Sect. 10.1.2 


in terms of the geometric properties of the coset space G/H. The Noether current 
for the spontaneously broken generator Q, can be written as 


OL ab b 
Jt = a T) = grem) EL (m) tr . (10.28) 


Expanding this in powers of the NG fields 2“ gives 
JË = goe (OEL (O)IMH* + da lger n? +O’). (10.29) 


I have already made the assumption that the kinetic term is canonically normal- 
ized, that is gap(0) = dap. Accordingly, the coupling of the current JË (x) to the 
one-particle state Ix? (p)) is defined by the matrix element (0| J% (0)|r?(p)) = 
—i Denk? (0). The bilinear part of the current (10.29) yields in turn the algebraic 
coefficient Kabe = ðe (gabé) (0) = ae (EL) (0). We only need the part of this, 


2 The amplitude is of course invariant under any simultaneous permutation of the energy-momenta 
and the flavor labels. This permutation invariance is just hidden in the economic expression (10.25). 


220 10 Scattering of Nambu-Goldstone Bosons 


antisymmetric under the exchange b <> c, which can be simplified to 
Kabe — Kacb = (Ve&4)60) — (V8 )e(0) = 2(Vc8%,)5(0) (10.30) 


using the Killing equation (A.92). Putting all the pieces together, we infer 
from (10.15) a geometric soft theorem for the amplitude Ag,...a,(P1,---. Pn) 
for scattering of n NG bosons with flavors a; and energy—momenta př, 


lim, E° (0) Alay --ay_;b(P1s -- -> Pn) 


Pn 
n-1 (10.31) 
= DOPE); (0) Alay --aj—,baj44dy—1 (pi, e.’ Pn-1) . 


i=1 


In [8], the same result was obtained from (10.27) by invoking the G-invariance of 
the scattering amplitude in the schematic form 


n—-1 
Lea Aara = BV Faas P Y Vaba) Aaii gbaaa = 0 x 
i=1 


(10.32) 


The formulation (10.31) of the soft theorem is manifestly covariant under field 
redefinitions that preserve the normalization of the kinetic term. One can take 
advantage of this and compute the algebraic coefficient (v? EPa; (0) in suitably 
chosen local coordinates on G/H. Specifically, in the exponential parameterization, 
we already have an expression for the Killing vector in (10.18). Likewise, the 
expression (10.17) for the MC form can be used to recover the metric ggp() and 
in turn the Christoffel symbols, needed to evaluate the covariant derivative of the 
Killing vector. Putting everything together, one finds 


A 1 
(VeE4)0(0) = 5 Gea fa — dba fac — Baa foe) » (10.33) 


in accord with (10.20). This confirms that the soft theorems (10.15) and (10.31), 
formulated respectively in terms of the algebraic and geometric properties of the 
coset space, are equivalent. 


10.2.2 Adler Zero or Not? 


We have already observed that if the coset space G/H is symmetric, the right-hand 
side of the soft theorem (10.15) or (10.31) automatically vanishes. Is the symmetry 
of G/H also a necessary condition for the Adler zero? We cannot exclude the 
possibility that the soft limit of a specific amplitude in a given theory vanishes due 
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to fine tuning of the effective couplings. However, in order that all amplitudes of the 
EFT vanish in the soft limit, the coefficients (10.33) must vanish for any choice of 
a, b, c. Using the shorthand notation fabe = bad fL. the vanishing of (10.33) is in 
turn equivalent to fabe = fbca + fcab. Applying this twice gives 


Jabe = Soca + Sab = (feab F Save) + (Jabe +F Soca) = 2 fabe + Joca + Feab . 
(10.34) 


This is only possible if fabe = 0. Thus, the set of all scattering amplitudes of 
the EFT satisfies the Adler zero principle if and only if the coset space G/H is 
symmetric. 

Incidentally, there is a neat geometric way to understand the one-way implica- 
tion, ensuring Adler zero for symmetric coset spaces. Namely, the latter have the 
property that all covariant derivatives of the Riemann curvature tensor identically 
vanish (Theorem 10.19 in [9]). The Adler zero can then be seen as a direct 
consequence of (10.27). On coset spaces that are not symmetric, the nontrivial soft 
limit of scattering amplitudes is governed by the structure constants fg. According 
to Sect. 7.4.1, these are in a one-to-one correspondence with the (frame components 
of the) torsion 2-form of the canonical connection on G/H. Thus, the nonvanishing 
soft limit can be said to arise geometrically from the torsion of the coset space. 


Example 10.3 


One of the simplest symmetry groups leading to a nonvanishing soft limit of some 
scattering amplitudes is the Heisenberg group H3. This is a three-dimensional 
nilpotent Lie group. One can choose a basis of its Lie algebra 3, Qa witha = 1,2 
and Q, so that the only nontrivial commutation relation among the generators is 


[Qa, Qb] = i£abQ . (10.35) 


Suppose the symmetry of a system under the Heisenberg group is completely 
broken. Denoting the NG fields associated with Q, and Q respectively as x“ and 6, 
the coset space H3/{e} ~ H3 can be parameterized by 


U(z,0) = exp( ir" 0) exp( 20 o) ; (10.36) 


where v is a positive dimensionful constant. The corresponding components of the 
MC form are w = (1/v)dz" and œ? = (1/v)dO + 1/(2v?”)eapn dr’. The most 
general two-derivative effective Lagrangian would now be given by a generic rank-2 
symmetric tensor built out of the MC form. For illustration, it is however sufficient 
to consider a particularly simple special case, 


1 1 1 2 
i= 5 Sab dy OFT” t3 (u0 F Sean un”) , (10.37) 
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This makes the calculation of various scattering amplitudes easy. Namely, the sole 
cubic interaction operator equals, up to a surface term, —1/(2v)ĝeapn“ On}. As 
a consequence, any Feynman diagram where a 6-propagator is attached to two 
external x-type legs will vanish on-shell. 

Any on-shell 3-particle amplitude in a derivatively coupled relativistic theory 
of massless scalars automatically vanishes. Hence we need to consider a process 
with at least five particles to have a hope for a nonzero soft limit. Leaving out 
straightforward details, a simple example is 


S12 
Agi222(P1, P2, P3, P4, Ps) = =z - (10.38) 


2v3 
In order to compare this to the prediction of our soft theorem (10.31), we have to 
keep in mind the factors of v in (10.36). These make the only nonzero structure 
constant effectively f°, = £ap/v. Together with £2(0) = 82, (10.31) and (10.33) 
then predict 


p 1 
lim A1222 (P1, -- -, Ps) = =—-[Ai122(p1,.--, pa) 
ps—>0 2v (10.39) 


— Ago22(pi,---, P4)] - 
This is easily verified by calculating the two 4-particle amplitudes and finding that 


3512 S12 


r} A , P2, P3, S 10.40 
Ay 9622(P1, P2, P3, P4) m2 ( ) 


Ayi22(P1, P2, P3, Pa) = 


By permutation invariance, the same kind of result applies to the soft limit in 
which either Ps or Ph is taken to zero. On the other hand, the soft limit of 
Agi222(pi,.--, ps) for pt or Ps manifestly vanishes, as is clear from (10.38). 

The cubic interaction operator in (10.37) can be removed by the field redefinition 
mo = pt + che” p’ with c = 1/(2v). This leads to the Lagrangian 


2 
LO = za +0707) o- 4 + xt — °$?) (8,0)? + AAOC) 
(oa ce? 2 
+ ELA x ðto + zi + 070°) x dub — cpd], 
(10.41) 


where I used the notation @ = (¢!, 7) and @ x Ono = Eabh Ong? to avoid 
proliferation of indices. The first term on the second line of (10.41) is equivalent 
to —(c?/3)0°@ x Og by partial integration, and thus does not contribute to tree- 
level 5-particle amplitudes of the theory. All such amplitudes are therefore encoded 
in a single operator, —3 0667 (H x 0,@). This shows that Agj222 and Ag2i11 are 
the only nonzero 5-particle amplitudes, and makes it easy to reproduce (10.38). 
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10.2.3 Symmetric Coset Spaces 


Expressing the scattering amplitudes in terms of the curvature tensor and its 
covariant derivatives is elegant and gives us useful insight. However, calculating 
the covariant derivatives of the curvature tensor from the metric g,p(z) in practice 
may be a tiresome task. The situation is much better for symmetric coset spaces 
where all the covariant derivatives of the curvature tensor vanish. Here we have 
a closed expression for the Riemannian metric in the normal coordinates (see 
Appendix A.6.4), 


Sab (7) = 8ac (0) 


sin2,/ R (sz) i O (—1)K22k+1 | 
ove = gac (0) > Chee 
k=0 


kyc 
(10.42) 


where Ê“ (r) = R° On", All we need to know are the constants R° „pg (0). 
These can be extracted from our discussion of the geometry of coset spaces in 
Sect. 7.4. First, however, a word of caution is in place. 

Throughout Appendix A, I dutifully distinguish components of tensors in a local 
frame from those in a local coordinate basis by using different types of indices. 
This has not been necessary in the main text of the book so far. Here, to avoid 
confusion, I will use lowercase indices a,b,... to indicate a local coordinate 
system, and underlined indices a, b,... for components in the frame defined by 
the MC form. Thus, for instance, a comparison of (10.16) and (10.23) shows that 
the G-invariant Riemannian metric on the coset space is g(7) = Kapat (7) @w2(r). 
The components of the metric in the MC form basis are constant, as shown explicitly 
in Sect. 8.1.2. 

According to Sect. 7.4.1, the same is true for the curvature 2-form of the LC 
connection on symmetric coset spaces, 


1 R 
RE O) = = 5 fap fedo (a) A O(a) . (10.43) 


Combining this with (10.42) gives upon a brief manipulation a practically more 
useful expression for the metric, 


c 
sin?,/ R(r) ; i - eee 
gT) = Kac = drt @ drt, RE, (0) = — fac Joan T , 


R(x) 


(10.44) 


where m4 = œ% (0)x^. Here w4(0) is a constant matrix that can easily be evaluated 
from the definition of the MC form, w1 (m) = @of(n)Qa = $r) Qadrã. 
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Combining (10.23) with (10.44) determines the complete two-derivative effective 
Lagrangian on symmetric coset spaces in a closed form. 

The structure constant fý, in (10.44) defines the adjoint action of the unbroken 
subgroup H on the broken generators, and thus also the NG fields. For compact 
and semisimple groups G, the other type of structure constant in (10.44), fpg, can 
be related to the former using the Cartan—Killing form on g; cf. Example 10.2. This 
leads to the intriguing conclusion that the effective Lagrangian (10.16) is completely 
fixed by the set of low-energy constants Kap, Spanning a symmetric invariant tensor 
of H, and certain linear representation of H. No specific information about the full 
symmetry group G is needed, as was first emphasized in [10]. 


Example 10.4 


The requirement that G be compact and semisimple is essential. Contrast the coset 
spaces SO(n + 1)/SO(n) ~ S” and ISO(n)/SO(n) ~ R”. Both of these are 
symmetric, share the same H, and in both the set of broken generators transforms 
as a vector of SO(n). Yet, the corresponding EFTs are different; SO(n + 1) is 
compact but ISO(7) is not. In case of SO(n + 1)/SO(n), the low-energy EFT is 
most easily presented as a nonlinear sigma model for a linearly-transforming unit 
vector field, n € S”. It implies nontrivial interactions among the n NG bosons. On 
the other hand, in case of ISO(n)/SO(x), the leading-order EFT (10.16) reduces to 
a noninteracting theory, Zett = (1/2)6ap0,.70° dx. This follows from the fact that 
the broken generators of ISO(n) commute with each other so that f, = 0. The 
curvature tensor is trivial. After all, ISO(n)/SO(n) is just the flat Euclidean space. 


10.3 Beyond Adler Zero 


Until now, we have been solely preoccupied with the question whether or not a given 
scattering amplitude vanishes in the soft limit for a chosen particle. This amounts 
to restricting the amplitude to a special hypersurface in the space of all allowed 
(on-shell and energy—momentum-conserving) kinematical configurations. There are 
however other interesting kinematical regimes that also probe the algebraic and 
geometric structure of the underlying EFT. One simple possibility is to take a 
consecutive soft limit for two particles. Here the geometric soft theorem (10.27) 
proves invaluable, giving immediately 


lim lim Abpea, (P1; +++s Pn) = Va, Van- Play -tp_o (Pls -- -> Pn—2) + 
Pn-1>0 pn>0 
(10.45) 
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Of particular interest is the commutator of the two limits, probing the extent to 
which the limits interfere with each other, 


lim , lim Aaa, (P1; -- -> Pn) 
Pn-1>0 Ppr>0 


n—2 (10.46) 


— b 
5 R aian 1an flay -aj-1bajg1 apo (pı, i Pn-2) ? 
i=l 


where I used (A.79). Another possibility is to take a simultaneous soft limit for 
two or even more particles. This may lead to a kinematic singularity even in the 
absence of cubic interaction vertices. See [8, 11] for a precise formulation of such a 
simultaneous double-soft theorem and further details. 

Here I will remain within the confines of the single soft limit, but look more 
closely at the asymptotic behavior of the scattering amplitude near the limit. I will 
promote the energy-momenta p” in an n-particle process to functions p” (z)ofa 
scaling parameter z € A such that A (1) = p” for all i and moreover pp (z) = zph. 
The other functions P; H(z) with i = 1,..., — 1 must be chosen so as to preserve 
overall energy-momentum conservalo, 3 pe (z) = 0, and remain on-shell, 
[pi(z)]? = 0, for any z € C. Also, it is required that p” (z) with i = 1,...,n — 1 
have a nonzero limit for z — 0, that is, only the n-th particle becomes soft for small 
Z: 

For a generic kinematical configuration, the complexified tree-level amplitude 
Aaa (Pi(z), ..-, Pn(z)) will presumably be analytic in z in some neighborhood 
of the origin. It can thus be expanded in a series with a nonzero radius of 
convergence, 


Aaya, (P), «+++ Pn—1(2)s ZPrn) = Sa On (10.47) 
k=0 


The soft theorems derived in Sects.10.1 and 10.2 determine the leading 
term, cO. “dn in terms of (n — 1)-particle amplitudes with energy—momenta 
Py (Os ae PEL ,(0). The asymptotic behavior of the amplitude at small z is 
dominated by the first term in (10.47) for which one -a, 1S nonzero. In the following, 
I will refer to the corresponding value of k as the soft scaling parameter and use the 
symbol o; 


Aga (P1), ii PEA), PSs Cl, FOR). (10.48) 


The notation used in (10.47) may suggest that the coefficients cf a are 


independent of the detailed choice of functions p;(z) beyond the basic 
requirements laid out above (10.47). I am not aware of any proof of this 


(continued) 
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implicit assumption, or even a study addressing the issue. Still, it is plausible 
to assume that at least the leading term in (10.48) is independent of such 
arbitrary choices. 


Clearly, o = 0 if the soft limit of the amplitude is nontrivial (nonvanishing). The 
Adler zero amounts to o > 1. Barring accidental cancellations, we expect a generic 
EFT satisfying the Adler zero principle to have o = 1. 


Example 10.5 


Recall that for n = 4, all the Mandelstam variables s, t, u vanish in the soft limit for 
any of the four particles. This automatically guarantees Adler zero for the 4-particle 
amplitude in any derivatively coupled EFT of massless scalars. Moreover, in EFTs 
of a single species of massless scalar, permutation invariance allows only one linear 
function of Mandelstam variables, s + t + u, which is identically zero. Hence, in 
EFTs of a single NG boson, the 4-particle amplitude automatically satisfies o > 2. 


In case o > 2 for all (tree-level) scattering amplitudes in a given EFT, we say 
that the soft limit of the amplitudes is enhanced. The rest of the present section is 
devoted to two immediate questions. Are there, in fact, any theories where scattering 
amplitudes feature an enhanced soft limit? If yes, is there any generic mechanism 
that makes the soft limit enhanced beyond the simple Adler zero? 


10.3.1 Dirac-Born-Infeld Theory 


The cleanest way to answer the first question is to find an explicit example. We 
will make an educated guess, loosely following [12]. Consider the class of EFTs 
of a single real NG field, m. We already know that o > 2 for the 4-particle 
amplitude. In fact, the constraint s + t + u = 0 leaves us with a single candidate, 
algebraically independent permutation-invariant 4-particle amplitude with o = 2, 
namely A(pı, p2, p3, p4) = (s? + + u?)/(4A pD). Here Ap is a nonzero 
constant of mass dimension D, the dimension of spacetime; the factor of 4 is a 
mere convention. This amplitude can be reproduced by a local quartic interaction 
operator, —[(8 1)? /(8Ap). This interaction in turn gives nontrivial amplitudes 
also for any even number of particles higher than four. 

The problem is that already the next, n = 6 amplitude fails to satisfy the 
desired scaling with ø = 2. By dimensional analysis, the n = 6 amplitude is a 
homogeneous function of degree 6 in energy—momenta. A detailed computation 
shows that the leading, c} term in its expansion (10.47) can be canceled by adding 
a new interaction operator carrying six derivatives, [(a,7)7 1° /(16A p). This pattern 
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extends inductively to all higher orders. For any n, the o = 2 scaling of the (tree- 
level) (2n)-particle amplitude can be rescued by adding an interaction operator 
[(Ouz7) and tuning its coefficient to a unique value. This ensures cancellation of 
the c contribution to the (2n)-particle amplitude, generated by exchange diagrams 
with lower-point interaction vertices. Altogether, we end up with a unique theory 
modulo the choice of A p; all the iteratively constructed interaction operators fold 
neatly into a closed-form Lagrangian, 


Zori = Apy 1+ (Oym)?/Ap . (10.49) 


This is the so-called Dirac—Born—Infeld (DBI) theory. 

A combination of (10.49) with the volume element, dx. / 1+ (Oy)? /A D, 
admits a remarkable geometric interpretation. This is the induced volume measure 
on a D-dimensional hypersurface (“brane”), embedded in a flat (D+ 1)-dimensional 
spacetime and parameterized by the Minkowski coordinates x“. The NG field x (x) 
arises from spontaneous breaking of the symmetry under translations in the extra 
dimension by the brane. Properly normalized, the displacement of the brane along 
the extra dimension is 2(x)/.//Ap|. The sign of Ap determines the signature of 
the metric in the extra dimension. For positive Ap, the spacetime Lorentz group 
SO(d, 1) is thus extended to SO(d, 2), whereas for negative Ap it is extended to 
SO(d + 1, 1). Up to an arbitrary choice of scale, we end up with two distinct DBI 
theories, differing by the geometry in the extra dimension. 

The fact that the DBI Lagrangian (10.49) does not contain any fields without 
derivatives automatically guarantees the Adler zero. The realization of the o = 2 
enhancement is however highly nontrivial. At first sight, it is not even obvious 
why the order-by-order cancellations required to make the soft limit enhanced 
should be possible at all. In Chap. 11, I will show that this is a consequence of 
the extended symmetry of the DBI theory. Let us therefore have a closer look at this 
symmetry. Under the translation in the extra dimension, the NG field shifts simply 
as (x) > 2’(x) = z (x)+ e€. What is more interesting are the rotations connecting 
the D physical dimensions to the extra dimension. These can be parameterized by a 
Lorentz vector €” and their infinitesimal form reads 


sgn Ap 
d(x) = y |A plex" , òx” = — 5 e"n (x). (10.50) 
Í vIAb| 


Note that this is not a spacetime symmetry by the definition given in Sect. 4.1, since 
òx” depends on the field x (x). Rather, (10.50) is an example of a generalized local 
transformation. Its evolutionary form is 


sen Ap 


vAn] 


which depends on both x(x) and its derivative. It is easy to verify that (10.49) is 
quasi-invariant under (10.51). 


m' (x) — w(x) = V|Aplenx + e"n (x)ðun (x), (10.51) 
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10.3.2 Galileon and Special Galileon Theory 


Operationally, the form of the DBI theory is so strongly constrained because its 
Lagrangian only depends on the first derivatives of the NG field. One might hope 
to find more examples of theories with scattering amplitudes enhanced beyond the 
plain Adler zero by allowing for higher derivatives. Indeed, one can realize o = 2 
trivially by setting 


1 
Let = 5 Ou)” + Zim) , (10.52) 


where nt is an arbitrary function of second derivatives of x(x) whose Taylor 
expansion in z starts at the fourth or higher order. This, while not very interesting 
per se, points in the right direction. Namely, the class of theories (10.52) is 
manifestly invariant under the so-called Galileon symmetry, 


d(x) =e +e ,x". (10.53) 


The interaction Lagrangian is strictly invariant, whereas the kinetic term is quasi- 
invariant. Mere invariance under the constant shift with scalar parameter €, together 
with the absence of cubic interaction vertices, already guarantees Adler zero. 
However, it is ultimately the full Galileon symmetry (10.53) that makes the soft 
limit of the scattering amplitudes enhanced; I will again return to this in Chap. 11. 
It is now natural to ask whether there might be any interactions that, similarly to the 
kinetic term, are merely quasi-invariant under (10.53). If present, such interactions 
would necessarily contain less than two derivatives per field, and thus realize a soft 
limit with ø = 2 in a nontrivial manner. Moreover, by power counting, they would 
dominate over the interactions of type nt(3 ðr) in (10.52). 

Finding all quasi-invariant Lagrangians requires an inspection of the relative Lie 
algebra cohomology of the spontaneously broken symmetry, as observed in a special 
case in Sect. 8.1. In case of the Galileon symmetry, a detailed analysis shows [13] 
that in D spacetime dimensions, there are D + 1 algebraically independent quasi- 
invariant Galileon Lagrangians, 


EA = Cee ee gag Oui Out) ++ (Ou, Ov, 70) 5 (10.54) 
where n = 0,..., D. The quasi-invariance of all BAe under the Galileon 


symmetry (10.53) is manifest. The first of these, a = (—1)?—' Diz, is a mere 
tadpole. This should be discarded in order for the EFT to be perturbatively well- 
defined. The next Galileon Lagrangian, BAe = (-1)?-!(D — 1)!zO7, is just the 
kinetic term up to a rescaling and integration by parts. Each of the interaction 
Lagrangians a with n > 2, or any linear combination thereof, realizes the o = 2 
enhanced soft limit of scattering amplitudes. One might be concerned about the 
presence of the cubic operator, EA This is however harmless. Namely, within the 
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class of Lagrangians 


D 
Faas, (10.55) 
n=0 


the couplings c, can be recalibrated by a one-parameter group of transformations, 
known as the Galileon duality [14,15]. This amounts to a field redefinition, and thus 
does not affect the S-matrix of the theory. It can always be used to set c2 to zero. 

The 4-particle amplitude in the Galileon theory (10.55) is a homogeneous 
function of degree 6 in the particle energy-momenta. Permutation invariance 
together with the constraint s + t + u = O implies that the amplitude equals 
s? +13 + u? up to overall normalization. Hence, in Galileon theories, the 4-particle 
amplitude is doubly enhanced, ø = 3. Is it possible to adjust the values of cn 
in (10.55) so as to maintain o = 3 for all multiparticle amplitudes? The answer 
to this question is positive, and the resulting one-parameter family of theories is 
dubbed special Galileon. 

The further enhancement of the soft limit of scattering amplitudes is associated 
with yet another, hidden symmetry [16]. Consider the infinitesimal transformation 


1 
Se (x) = eh” [r + garona) , (10.56) 
AD+2 


where Ap+2 is a (nonzero) constant with mass dimension D + 2 and é”” is 
a traceless symmetric tensor of parameters. This is another nontrivial exam- 
ple of a generalized local transformation. To check whether (10.56) leaves the 
Lagrangian (10.55) quasi-invariant, it is sufficient to inspect the variation of the shift 
of the corresponding action, ðe SGal, with respect to x. A direct calculation gives 


D 
8c Scat) SD Jafra 1X, + — (D—n)(D—n DX f 
Z D+2 


n—-1 
= eli HnÀn+1 ÀD gin . 
Xn =€ mon E Anqi-Ap £ Han ] [@u: ðv) ’ 
i=1 
(10.57) 
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a reader wishing to verify this might want to take advantage of the auxiliary identity 


n+1 


Hiv Un+2Àn+3 ÀD Y1 Pn +2 
E e Aàn43 Àp EHn+2Vn+2 u; dvi) 
i=l 


n+ 1 gli HnÀn+1 ÀD gV1Vn 


TE S TERY (10.58) 


n—l1 


x e”. Boece J [Onan — Ga dgz) Towa] . 


i=l i=l 


The quasi-invariance of the Lagrangian (10.55) is now equivalent to the vanishing 
of (10.57), which leads to a recurrence relation for the couplings cn, solved by 


on = 5 (2) ae as 7. ) “| (10.59) 


Ab42 2n +1 Mp4 D 2n+1/n+1 


What we have here are in principle two independent Ki of theories. However, 
the even set cz, inevitably includes the tadpole operator a that we have discarded 
on physical grounds. We are thus left only with the odd terms c,1, that is those 
Galileon operators (10.54) that are even in the NG field x. The parameter cı is 
fixed by demanding proper normalization of the kinetic term. The only genuine 
parameter of the special Galileon theory is therefore the scale A p+» that enters the 
symmetry transformation (10.56). Specifically in D = 4 spacetime dimensions, we 
have c3 = —c;/(2Ag6) as the sole effective coupling. Upon some integration by 
parts, the Lagrangian of the special Galileon theory can then be cast as 


Lai" ~ 5) - —— (ôu) [(Or)? — (y8,20)"] . (10.60) 


12A6 


See [17] for a recent overview of the many intriguing properties of the special 
Galileon theory. 

So far we have discovered three theories that realize in a nontrivial manner an 
enhanced soft limit of scattering amplitudes: DBI, Galileon and special Galileon. 
Remarkably, this is it as far as relativistic EFTs of a single NG boson are con- 
cerned [18]. Mapping the landscape of theories with o > 2 and multiple NG bosons 
remains, as of writing this book, an open problem. Below, I outline a symmetry- 
based approach that makes the identification of candidate multiflavor EFTs in 
principle straightforward. In Sect. 10.4, I will then introduce an entirely different 
method, which utilizes solely known infrared properties of on-shell amplitudes and 
general quantum-field-theoretic principles. This approach is very efficient in proving 
that other nontrivial soft behavior than that observed in the above three theories is 
not possible. 
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10.3.3 Effective Theories with Enhanced Soft Limit from Symmetry 


I have already hinted, so far without proof, that the enhanced soft limit of scattering 
amplitudes in the DBI and Galileon theories is a consequence of symmetry. 
The additional symmetry cannot be internal, for that would lead to mere linear 
constraints on scattering amplitudes or, even worse, additional NG bosons if 
spontaneously broken. Indeed, a glance at (10.50) and (10.53) shows that we are 
dealing with transformations with nontrivial dependence on spacetime coordinates, 
parameterized by a Lorentz vector. Guided by this observation, I will now show 
that the presence of such symmetries is severely constrained by group-theoretical 
consistency requirements. The discussion in this subsection is a special case of a 
framework developed in [19]. 

Consider an EFT for a single species of NG bosons. The symmetry of this EFT 
is generated by the operators of angular momentum J,,, and energy-momentum 
P,,, and a spontaneously broken scalar generator Q. The commutation relations 
between J,,, and P, are fixed by the Poincaré algebra. Furthermore, [J,», Q] = 0 
expresses the fact that Q is a scalar, whereas [P,, Q] = O is needed for Q to 
generate an internal symmetry. Let us now add a Lorentz-vector operator K,,, 
generating the tentative hidden symmetry responsible for enhancement of the soft 
limit of scattering amplitudes. Lorentz invariance restricts the commutation relations 
between K,, and the other generators to? 


[Jus Kils i(8&vaKu F Zua Kov) , [Pus Ky] = i(agpvQ T bJuv) , 


[Ky, Ky] = icJw , [Ky, Q] =i(dP,, +eK,) . 
(10.61) 


The parameters a, b, c,d, e cannot take arbitrary values, since to correspond to a 
well-defined Lie algebra, the commutators must satisfy a set of Jacobi identities. 
A straightforward calculation leads to the algebraically independent constraints 
ae = 0, b = 0 and c = —ad. Should [P,,, Ky] be nonzero, which is necessary for 
the symmetry generated by K,, to affect the momentum dependence of scattering 
amplitudes, a must be nonzero as well. Upon absorbing it into a redefinition of Q 
via Q — Q/a, the three commutators in (10.61) not fixed by Lorentz invariance 
reduce to 


[Pu Ky] =igyvQ >, [Ku Ky] = “WJ yy >, [Ky, Q] =ivP, ; 
(10.62) 


with the shorthand notation v = ad. 


3 Additional terms containing the LC tensor can be added to some of the commutators in D = 3 
or D = 4 spacetime dimensions. These however turn out to be irrelevant in the sense that they are 
either forbidden by Jacobi identities or can be absorbed into a redefinition of the generators. 
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The case of v = 0 describes the Lie algebra of Galileon transformations (10.53). 
Here Q generates the constant shift with parameter €, whereas K, generates the 
linear shift with parameter e€,,. It remains to address the case v # 0. Here the 
parameter can be nearly eliminated by rescaling both Q and K, by \/|v|. What is 
left of v is just its sign, entering through [K,,, Ky] = FiJ,v and [K,, Q] = iP}. 
Finally, merge K, with Ją» and Q with P, by identifying J,.p41 = K, and 


Pp+1 = Q together with gpD+1,Dp+1 = +1 and voila, we find the (D + 1)- 
dimensional Poincaré algebra. Its v = +1 version is based on the SO(d, 2) group of 
spacetime rotations, whereas the v = —1 version is based on SO(d + 1, 1). These 


correspond precisely to the two mutations of the DBI theory. 

We have successfully recovered the symmetries of the DBI and Galileon theories 
by a simple Lie-algebraic argument. The utility of the symmetry-based approach of 
course does not end here. It can be pursued further towards an explicit construction 
of effective actions, respecting the symmetry. This however requires a generalization 
of the techniques developed in Chaps. 7 and 8 to coordinate-dependent symmetries. 

The classification of candidate symmetries leading to enhancement of the soft 
limit of scattering amplitudes, outlined here, can be extended in various directions. 
First, we may want to see how the special Galileon theory fits in. As suggested 
by (10.56), this requires adding another set of generators that span a traceless 
symmetric tensor representation of the Lorentz group. While the derivation of all 
the Lie-algebraic constrains is now more laborious, one eventually recovers (10.56) 
along with (10.60) as the only physically relevant solution [19]. A similar reasoning 
was used in [20] to show that in D = 4 dimensions, there are no EFTs of a single 
NG boson that would realize o > 4 enhancement of the soft limit in a nontrivial 
manner. Finally, the Lie algebra (10.61) can be extended to allow for multiple flavors 
of NG bosons. This turns out to be an efficient tool for carving out the landscape 
of potentially interesting EFTs [21]. The DBI and Galileon theories have a natural 
multiflavor generalization. However, the precise relation between the symmetry and 
the asymptotic behavior of scattering amplitudes in the soft limit that would allow 
us to unambiguously predict the value of o, remains unknown. 


10.4 Soft Recursion 


No primer on scattering of NG bosons can be complete without at least briefly 
mentioning on-shell methods for scattering amplitudes, developed in the last 
decades. Given the context of this book, I will restrict the discussion to on-shell 
recursion relations for EFTs of NG bosons. These emerged as an adaptation of the 
framework, originally designed for the Yang—Mills theory by Britto, Cachazo, Feng 
and Witten. An interested reader will find an introduction to the latter in Chap. 3 
of [4]. 

We return to the broad class of EFTs for multiple NG boson flavors. For notation 
simplicity, I will however suppress flavor and momentum labels and denote the 
on-shell tree-level amplitude for an n-particle scattering process simply as An. 
Suppose now that we are able to complexify the energy—momenta in the process, 
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př — pi'(z), so that all the p” (z) are linear functions of z € C, satisfying 
VA (0) = p” . We of course still require that the pe (z) add up to zero and remain on- 
shell, [ P; (z)]? = 0, for any z € C. Then the scattering amplitude A, is complexified 
to a meromorphic function An (z) in the complex plane. Furthermore, for any 
holomorphic function F,(z) such that F,(0) = 1, the function An (z)/[zFn(z)] 
is also meromorphic and has a simple pole at z = 0 with residue An (0) = An. 
The residue theorem then implies that upon integration along an infinitesimal circle 
enclosing the origin, 


i x 
An aa An (z) 


- , 10.63 
miS RO aR 


By the same residue theorem, the integral can also be expressed in terms of residues 
at the other poles in the complex plane, possibly including the residue at infinity. 
This is the central idea behind on-shell recursion techniques. To progress further, 
we need to be more specific about the complexification of the energy-momentum 
variables. We also need input on the asymptotic behavior of the amplitude at z —> 
oo. Together, these will allow us to tailor the function Fn (z) to the EFT at hand. 


10.4.1 Complexified Kinematics 
Our requirements that pe (z) be linear in z and reduce to př at z = 0 are solved by 


Bi (2) = př + zq” (10.64) 


with arbitrary a However, the on-shell condition [f;(z)]> = 0 is most easily 
satisfied if që is parallel to př . I will parameterize such q! by a real constant c;,* 


By (z) = př (1 — ciz) . (10.65) 
This is called the all-line soft shift of the energy-momentum variables. The last 


requirement we have to take care of is overall energy-momentum conservation, 
which leads to 


n 
X cpt =0. (10.66) 
i=l 


I will always assume a generic kinematical configuration, disregarding accidental 
linear dependencies among the energy-momenta p” . This is equivalent to the 


4 Within this section, I always indicate summation over particles in the scattering process explicitly. 
Hence, no summation is implied by the repeated index “i” in (10.65). 
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assumption that p” as a D x n matrix has the maximum possible rank consistent 
with energy-momentum conservation, that is min(D, n — 1). By the rank—nullity 
theorem, the set of solutions to (10.66) then spans a vector space of dimension 
n — min(D,n — 1) > 1. The one-dimensional subspace that is always guaranteed 
to exist corresponds to cı = --- = Cy. However, it is desirable to have solutions for 
which all the c;s are different. This will allow us to probe the single soft limit for the 
individual particles by tuning z — 1/c;. The existence of such solutions requires 
that D < n — 1, or better n > D+2. 

The all-line soft shift (10.65) cannot access the single soft limit of all scattering 
amplitudes, and the problem gets worse with increasing the spacetime dimension 
D. This motivated [18] to introduce alternative prescriptions, combining (10.64) 
and (10.65) for disjoint subsets of the n particles. To illustrate the idea, I will briefly 
outline the minimal modification of (10.65), dubbed the all-but-one-line soft shift. 
This applies (10.65) to the first n — 1 particles but uses (10.64) for the last one, 


pi (z) = pi (1—cjz) fori =1,...,n—1, PË (z) = ph+zqh. (10.67) 


Energy—momentum conservation and the on-shell condition now dictate that 
n=1 
qn = do api, i, = Pn’ =O. (10.68) 
i=l 


The first relation in (10.68) can be viewed as a definition of qk. The second relation 
then constitutes two homogeneous constraints on C1, ...,Cn—1, one linear and one 
quadratic. This defines an (n — 3)-dimensional hypersurface in the R’~! space of 
all c;. The hypersurface always includes a one-dimensional linear subspace where 
c1 = +--+ = Cn—1, as guaranteed by energy-momentum conservation. The existence 
of other, nontrivial solutions for c; requires that n — 3 > 1, orn > 5, independently 
of the spacetime dimension D. We find that the all-but-one-line shift is applicable 
to a broader set of amplitudes than the all-line shift for any D > 4. The price to pay 
is that it only allows us to access the soft limit for n — 1 particles out of n. 


10.4.2 Recursion Relation for On-Shell Amplitudes 
We are now ready to deal with the complexified amplitudes. Consider an EFT 
with soft scaling parameter o. By definition, any complexified n-particle amplitude 


An (z) in this theory has a zero of order o at z = 1/c; for any of the particles to 
which (10.65) has been applied. It is then advantageous to set 


Frio (z) = [a = ciz)? , (10.69) 
i=l 
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where n; = n for the all-line shift and ns = n — 1 for the all-but-one-line shift. This 
choice of F, (z) is optimized for maximum suppression of the integrand in (10.63) 
at large z without introducing any new poles in it. This suppression is sufficient to 
eliminate the pole at infinity provided 


m<Nso , (10.70) 


where m is the degree of A, as a function of energy—momenta. 


The constraint (10.70) is satisfied for a large class of EFTs. First, any EFT of the 
type (10.16), defined on a symmetric coset space, has (m, 0) = (2, 1) for any n. This 
follows from the relation V = J + 1 between the numbers of interaction vertices 
V and internal propagators 7, valid for any connected tree-level Feynman diagram. 
Second, the Lagrangian of the DBI theory (10.49) contains one derivative on each 
field. As a result, the negative powers of energy-momentum due to propagators are 
exactly canceled by the adjacent interaction vertices. What is left is in effect one 
derivative per each external leg of the Feynman diagram, hence (m, o) = (n, 2) for 
the DBI theory. Finally, the special Galileon theory in D = 4 dimensions contains 
a single quartic interaction vertex with six derivatives. This gives m = 6V — 2/ and 
n = 4V — 21, hence (m, o) = (2n — 2, 3). 


With no pole at infinity and no new poles from F,,o (z), all the z 4 0 poles 
of the integrand in (10.63) come from the propagators inside An (z). Consider a 
partition of all the particles in the scattering process into two disjoint subsets, I 
and J. Among all diagrams contributing to the tree-level amplitude Ay(z), there 
is a subset of graphs where the particles in J and / can be separated by cutting a 
single propagator. This subset of diagrams defines a factorization channel, which 
I will label simply with the letter 7; see Fig. 10.3 for illustration. One can always 
assign the labels 7, Í so that the n-th particle belongs to I. Then c; is well-defined 
for all particles in J even when the all-but-one-line shift is used. With the shorthand 


Fig. 10.3 Example of a 
factorization channel in an 
8-particle scattering process. 
The two sides of the 
factorization channel 
correspond respectively to 

I = {1,5, 6, 7} and 

T = {2, 3, 4, 8}. The gray 
disks represent the sum of all 
possible Feynman diagrams 
with the given external legs 
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notation 


PY = Xpt, f= X a. (10.71) 


iel iel 


the complex energy-momentum in the propagator separating I and J is p” =g or : 
This leads to two poles at z = z7 where 


N 
~ 


Fa 1 
= al” -Qr +y Pr: Q- PO]. (10.72) 


Near these poles, the complex amplitude An (z) factorizes as 


x cont APAT) are; APAT) 
A, (z2) —> = ; 10.73 
(z) (Pi — zQ1)* (z— zp} )(z-z7) P? i i 


Here A; (z7 ) is the complexified on-shell amplitude for scattering of particles in /, 
and likewise for A j eP). We arrive at the key result that the n-particle amplitude 
An can be reconstructed from on-shell amplitudes with lower numbers of particles, 
as long as it can be complexified using one of the prescriptions introduced in 
Sect. 10.4.1. Importantly, this soft recursion applies to the entire amplitudes. This 
observation lies at the heart of an approach to symbolic computation of scattering 
amplitudes that avoids the combinatorial explosion of ordinary perturbation theory. 
In practice, one only needs to calculate explicitly a small number of “seed” 
amplitudes. All higher-point amplitudes can then be reconstructed recursively. 


In D = 4 spacetime dimensions, the all-line shift (10.65) can be used for any 
amplitude with n > 6. This leaves us with A4 and As as the seeds that need to 
be known a priori. For theories with just even interaction vertices, such as those 
from Example 10.6, one only needs to calculate the 4-particle amplitude explicitly. 
This is consistent with what we learned previously about the structure of EFTs on 
symmetric coset spaces. Namely, it follows from (10.42) that the quartic interaction 
vertex already fixes uniquely the entire effective Lagrangian. The same applies to 
the DBI theory. 


Let us now see the recursion program through to the end. Upon summation over 
all factorization channels, (10.63) becomes [22] 


A + 5 —ĝ F 
a-y 1 al 7 an Aj (<P) Ar (ZAG ) |. (10.74) 


1— z} /z;)Fao ZE) (z7 /z})Fno(z7) 
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The sum runs over all partitions of the particles, that is each pair Z, Í is only 
counted once. This is our master equation, which allows recursive construction 
of tree-level scattering amplitudes by symbolic computation. However, the first 
step of the recursion can often be performed manually. Namely, if the functions 
Ay (z), Ar) themselves do not contain any factorization poles, the summand 
in (10.74) can be interpreted in terms of the residues of the meromorphic function 
Ar DA (2) /[z Fao (Z)(PI — z0]. This function does have also poles at z = 
1/cj, since unlike An (z) itself, Ay (z) and A 7(Z) are not on-shell and thus do not 
necessarily vanish at these points. We can then use the residue theorem to rewrite 
the sum over poles at z7 in terms of a sum over poles at z = 0 and z = 1/c;, 


A OA; (0) ADAC) | 
Res y 10.75 
a= S IER Pe oi eer = 2072 ane) 


The first term takes into account diagrammatic contributions to A, with an internal 
propagator. The second term must therefore match the contributions to A, from 
contact n-point operators in the effective Lagrangian. 


The special Galileon theory (10.60) in D = 4 dimensions has a single, quartic 
interaction vertex. When applied to the 6-particle amplitude, we therefore expect 
the second term in (10.75) to vanish. To check this, rewrite the Lagrangian as 


3 
1 
Loa = T Oun? - me ] [@u:%,7) - (10.76) 
i=l 


The Feynman rule for the interaction vertex is [—i/(6^6)]G(p1, p2, p3), where 
py : PS ; Ps are any three of the four four-momenta in the vertex and G denotes 
the corresponding Gram determinant. Since the latter is a homogeneous function of 
all its arguments of degree two, we have 


6 


Ar @A;(2) = G((pilieN GAP} ei) IIc — C42)" (10.77) 


ee (6A6)? 


for any partition of the six particles into two triples 7, I. I have implicitly used the 
all-line shift. Note that the criterion (10.70) is satisfied even if we take F,, 5 (z) with 
o = 2 instead of o = 3. The advantage of this choice is that the denominator term 
Fa,o (z) is then completely canceled by the last factor in (10.77). This indeed makes 
the second term in (10.75) disappear since the residues at z = 1/c; trivially vanish. 
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The derivation of (10.74) does not apply to theories with a nonvanishing soft 
limit, where o = 0 and (10.70) cannot hold. Here we have two alternatives: either 
use Fa o(z) = 1 and deal with the residue at infinity, or use Fa, (z) with o > 
1 and deal with the ensuing poles at z = 1/c;. The latter option is feasible, at 
least for two-derivative EFTs of the type (10.16), where sufficient suppression of 
the integrand in (10.63) is ensured by F;,,1(z). The residues at the simple poles at 
z = 1/c; can then be extracted from the geometric soft theorem (10.27) or its more 
explicit version (10.31). A minor modification of the steps leading to (10.74) now 
gives [8] 


1 APAP) "Va; An-1(/ci) 
Ar = —— at . (10.78) 
3 P? 2 A= zr/z DEn e say [etl ae) 


Here I used the shorthand notation An 1 for the amplitude where the i-th particle 
with flavor index a; has been removed. It is of course still possible to rewrite the 
first term in (10.78) as (10.75). 


10.4.3 Soft Bootstrap 


The modern scattering amplitude program strives to construct amplitudes, or even 
define what a quantum field theory is, without recourse to a Lagrangian or the 
symmetry thereof. It is therefore of great interest to understand what kind of soft 
behavior of scattering amplitudes is allowed on general grounds. Recursion relations 
such as (10.74) or its generalization (10.78) provide a versatile tool for constraining 
the landscape of possible EFTs. 

In [18], soft recursion was used to derive a set of bounds on the soft scaling 
parameter o. The most striking result is that in D > 4 spacetime dimensions, o = 3 
is the maximum value that can be realized in a nontrivial manner. (Any ø can be 
realized trivially by interactions with at least o derivatives per field.) Apart from 
this universal bound, the maximum achievable value of o in an EFT with a fixed set 
of interaction operators is also constrained by the number of derivatives per field. 
Consider a contact operator of the schematic type (d)*"~*"~?, and characterize 
the number of derivatives per field by the parameter 9 = (m — 2)/(n — 2). We thus 
have @ = 1 for the DBI theory and 9 = 2 for all the Galileon interactions (10.54). 
Then, the soft scaling parameter is bounded by o < ọ + 1. This shows that the DBI 
and special Galileon theories are “exceptional” in the sense that they maximize o in 
their respective classes of theories with fixed 9. Similar bounds can be derived for 
translationally and rotationally invariant nonrelativistic EFTs [1]. 

The values of ø and ọ alone do not necessarily specify a unique EFT. Here 
one can gain further insight by a procedure known as the soft bootstrap, see for 
instance [23,24]. In D < 4 dimensions, the only seed amplitudes needed to apply 
soft recursion in combination with the all-line shift are A4 and As. These are 
contact amplitudes without any exchange contributions. As such, they are given by 
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polynomials of degree m in the particle energy-momenta, or m /2 in the Mandelstam 
variables. These polynomials are restricted by energy-momentum conservation and 
permutation invariance, and their complete classification is usually straightforward. 
With a list of candidate seed amplitudes at hand, one then proceeds to construct 
higher-point amplitudes via (10.74) or (10.78). The result of the recursion must be 
independent of the choice of parameters c; satisfying (10.66) or (10.68). Failure to 
pass this cj-independence test indicates that the seed amplitudes do not correspond 
to any well-defined local field theory. 


Soft bootstrap cannot, strictly speaking, be used to prove that a theory based 
on a fixed set of seed amplitudes exists. That would require checking the 
cj-independence of recursively constructed amplitudes to all orders of the 
recursion. In this aspect, soft bootstrap is nicely complemented by more 
conventional EFT approaches based on symmetry. The latter are efficient in 
isolating a set of candidate EFTs with desired particle composition and soft 
behavior, as illustrated in Sect. 10.3.3. What soft bootstrap can do is show that 
other theories with given soft behavior than those known a priori do not exist. 
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Part IV 


Spontaneously Broken Spacetime Symmetry 


A 
| | 
Check for 
updates | 


Our discussion of effective field theory (EFT) for Nambu—Goldstone (NG) bosons 
has so far been limited to internal symmetries. The application of the EFT program 
to symmetries that depend on or affect spacetime coordinates is associated with 
numerous subtleties. The first of these lies in the identification of the degrees of 
freedom of the low-energy EFT. We saw already in Chap. 6 that the fluctuations of 
the order parameter induced by independent broken symmetry generators may be 
related to each other. Very often, this is a consequence of a redundancy between the 
localized actions of different symmetry transformations. This chapter explores such 
local equivalence of coordinate-dependent symmetries as a preparation for the more 
detailed discussion of broken spacetime symmetries in the following chapters. 

The first two sections follow the pedagogical exposition in [1]. In Sect. 11.1, 
I show that local equivalence of symmetries leads to a linear relation between 
the corresponding Noether currents. This lifts the intuitive classical picture of 
redundancy among NG modes, outlined in Chap. 6, to the language of quantum 
field theory. In Sect. 11.2, I then reinterpret the local equivalence of symmetries in 
terms of their simultaneous gauging. The whole Sect. 11.3 is reserved for several 
physically relevant examples, illustrating the general theory of locally equivalent 
symmetries. Finally, in Sect. 11.4, I return to scattering of NG bosons of internal 
symmetries. I show how the relations between Noether currents derived in this 
chapter help explain the enhancement of the soft limit of scattering amplitudes in 
the Dirac—Born—Infeld (DBI) and Galileon theories. 


11.1 Relations Between Noether Currents 


Consider a theory of a set of (not necessarily scalar) fields y’, and suppose that 
its action is invariant under two different classes of continuous symmetries. As we 
know from Chap. 4, the associated Noether currents can be extracted by evaluating 
the variation of the action under the corresponding localized transformations. The 
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evolutionary form of these transformations can be written generally as 


Sy E) = efa Fiy, x10) + Ip OK TW, x1) ++, 


(11.1) 
Soy (x) = G AFAN, xE) + Ores (x) Kgl TW, x1) +o 

As before, the square bracket notation indicates local functions of the fields and 
their derivatives, possibly also depending explicitly on the spacetime coordinates. 
Furthermore, er (x) and ey (x) are the localized parameters of the transformations. 
The presence of derivatives of the parameters in (11.1) hints that I allow for a 
generic localization of the transformations; the ellipses represent terms with higher 
derivatives of e (x) and ey (x). The original symmetries are recovered by reducing 
the coordinate-dependent parameters to constants, a) > a and es! (x) > a I 
will call the two symmetries locally equivalent if there is a set of smooth functions 
fe (x) such that setting 


ef (x) = fit (es (x) (11.2) 


makes the two transformations in (11.1) identical, 8, yÝ (x) = Sow! (x). 
Locally equivalent symmetry transformations should give the same variation of 
the action as expressed in terms of the corresponding Noether currents, 


ôs = | as JE lw, 41) dyer (x) = fo JK Iy, A) one x) 
(11.3) 
= / dx FHL, KOOL At apes (x) + 12) Oy fal E] - 


However, the latter expression does not automatically vanish for constant ey as it 
should. In order to ensure this, we have to impose the integrability condition 


JE lv, xl fe = ANE, x] , (11.4) 


where Ny [y, x] is some local vector function of the fields and their derivatives. 
Combining (11.4) with (11.3) and integrating by parts in the latter then leads to the 
Noether current equivalence relation 


Jol, x1 = fot JEL, x] NYI, x]. (11.5) 


This is the main result of the chapter that deserves a few comments. First, 
an alternative approach to local equivalence is to start with constant er, that is 
the actual symmetry with generators Q2y. One then demands the existence of 
smooth functions de (x) such that this symmetry is reproduced by a localized 
transformation of the type 5;w! with parameter ef (x) = tx (xz. This is the 


origin of the integrability condition (11.4). If needed, the localized transformation 
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ŝ24Ż can then be defined as òi yi with the parameter ef(x) as given in (11.2). It 
is therefore reasonable to think of 8,1! as a parent symmetry and of 871’ as its 
descendant. 

Second, I have not imposed the equation of motion. Thus, both the equivalence 
relation (11.5) and the integrability condition (11.4) must hold off-shell. By taking 
a divergence of (11.5) and using (11.4), we obtain another off-shell relation, 


Ou Jou lv, x] = | a ôu JH IY, x]. (11.6) 


This makes the Ward identities for the correlation functions of Ji a consequence 
of those for JË 4: By the same token, the two sets of currents couple to the same 
one-particle NG states. This explains in a quantum-field-theoretic language the 
redundancy of NG modes generated by locally equivalent symmetries. 

Third, should fa be constant, the integrability condition (11.4) is fulfilled for 
N = 0. The equivalence relation (11.5) is then strictly linear in the currents. The 
corresponding generators Q14 and Qo2y are also linearly related, Q% = fo A. 
This special case therefore amounts to a mere change of basis of symmetry 
generators, or a choice of a subset thereof, and is of little interest. A nontrivial 
current equivalence relation only arises from a set of non-constant functions ix (x). 
Without loss of generality, one can drop the constant part of ix (x) and assume 
that i, (0) = 0. This assumption is satisfied for all the examples worked out below. 
It also guarantees a posteriori that the relation (11.2) cannot be inverted. In other 
words, the distinction between a parent symmetry and its descendant is well-defined 
and cannot be reversed. 

Finally, recall that the Noether current is determined by the symmetry transfor- 
mation only up to addition of a vector field whose divergence vanishes off-shell. 
Suppose that we modify the parent Noether current by J y Azad in at és y a Such that 
OMN i A = 0. Then the integrability condition (11.4) can still be satisfied if we shift 
Ny appropriately, Nyx > Ny + SESI 3 a- It follows that the ambiguity of the parent 
current does not affect the descendant current Ji as given by (11.5). The descen- 
dant current of course suffers from its own ambiguity under Ii > T + S Jho 
such that dS Jho = 0 off-shell. The ambiguities of the two currents are therefore 
independent of each other and do not constrain the validity of the equivalence 
relation (11.5). 


11.2 Noether Currents from Background Gauging 


The derivation of the Noether current of a given symmetry, based on (4.7), treats 
the localized symmetry transformation as a mere technical trick. There is however 
an alternative approach to Noether’s theorem that takes the local transformation 
seriously and promotes it to an actual symmetry of the action. This offers additional 
insight into the structure of Noether currents, the price to pay being a technical 
assumption that has to be verified case by case. 
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This key assumption is that it is possible to make the action of the theory 
exactly invariant under the localized symmetry transformation by adding a set of 
background gauge fields Ai. More concretely, we want to promote the original 
action S{y} to a functional Sty, A} such that Sty, A} | ac = = S{w}, where the 
subscript “vac” indicates removing the background. I allow for a generic local 
transformation of yi with parameters e* (x), following (11.1), 


Swi (x) = eA) FH, xE) + ap OK y, xx) ++ (11.7) 


As for the local transformation of the gauge field, we can assume a similar generic 
expansion in derivatives of the parameters €^ (x), 


SAM (x) = P (x) FR LA, x](x) + P (x) KBrlA, x](x) +--+ (11.8) 


the ellipsis again stands for terms with higher derivatives of €^ (x). It is important 
that §A4 does not depend on the dynamical fields y’ so that At can be treated as a 
fixed external background. 

The variation of action under a simultaneous transformation of yi and Aj 
consists of contributions from varying the former and the latter, 8S = y + 848, 
where 


> 8S . P 
ôa = f aP m a L (11.9) 
u 


Next we remove the background. In this limit, SyS rai by construction into 
f dx J NG A In order that 8S vanishes at least when €4 is constant, there must be 
a local function R4 ALW, x] such that 


òS{y, A} = 


pe [A, x](x) 
Gy 


= ð R4 IY, x](x) . (11.10) 


vac 


With this consistency condition satisfied, the vanishing of 8S for any €4(x) leads to 
the identification 


8S{W, A} = 


Jih, x(x) = RAL, x1) — SAB (a) 


KPA, o| ees. D 


vac 


Equations (11.10) and (11.11) already constitute a fairly general tool for derivation 
of Noether currents. Yet, the approach based on background gauge invariance can be 
further extended to higher-rank tensor background fields. Instead of cluttering the 
formalism with additional indices, I will content myself with a couple of illustrative 
examples, worked out in Sects. 11.3.2 and 11.3.3. 
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The notation used in (11.10) and (11.11) is tailored to what I called the 
parent symmetry. What if the system possesses an additional, descendant set of 
symmetries? By (11.2), the localized transformations of the descendant type form 
a subset of all localized transformations of the parent type. All we therefore have 
to do is to gauge the parent symmetry by adding its background gauge fields Ai. 
That we can in fact do so is the sole technical assumption we have to make. The 
descendant symmetry is then automatically gauged as well without the need for any 
additional gauge fields. This once again underlines the redundancy of the descendant 
symmetry. In particular, the background gauging makes it possible to recover the 
equivalence relation (11.5) for Noether currents; see [1] for a detailed proof. 


11.3 Examples 


I will now illustrate the general theory developed above with several examples. 
These are chosen mostly for their relevance to EFTs discussed elsewhere in the 
book. I will therefore not shy away from showing certain amount of details. 


11.3.1 Galileon Symmetry 


Consider a class of theories of a single real relativistic scalar field @, invariant under 
the Galileon symmetry 


S(x) =e tet, . (11.12) 


This can be localized by replacing the constant parameters with arbitrary functions, 
€1 — €1(x) and eS > e} (x). The shift by €; is the parent symmetry. The 
descendant symmetry with parameter e} can be recovered locally by setting 
€1(xX) = e} (x)x,. This is the local equivalence condition (11.2) where fy (x) = Xp- 
The integrability condition (11.4) and the current equivalence relation (11.5) then 
translate to 


Jilo, x] = NETO, x], ILO, x1) = xy JIo, x(x) — NBTG, xŒ). 
(11.13) 


Interestingly, the set of functions N} [@, x] is in this case the primary object, which 
by means of (11.13) determines both currents. 
Let us see how this works explicitly. Take the class of Lagrangians of the type 


D 
L = Y cn LG + Zm) . (11.14) 
n=0 
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Here int is an arbitrary function of second derivatives of œ, which is manifestly 
invariant under (11.12). Furthermore, ae with n = 0,..., D is the set of quasi- 
invariant Galileon Lagrangians, cf. Sect. 10.3.2, 


ge) _ erinto Mm, bh Bu Bi) ++ (Ou, 3n P) « (11.15) 


It is easy to extract the current J i by evaluating the variation of the action under the 
shift d(x) > (x) + €1(x). With the shorthand notation for the symmetric tensor 


D n 


x" =- Ya + A cee nAn REO i | [0n ðv), (11.16) 
n=0 i=2 


the current takes the compact form 


dZint 


maa i 11.17 
TT) i 


JË = X" ad — a 


This matches the integrability condition in (11.13) since X#” obviously has 
vanishing divergence and so X¥"0,@ = ə (X””¢). It follows that 


dZint 


NË = XH go 
0(0,0"d) 


(11.18) 


and the current J is then given by the second relation in (11.13). This result can 

be easily verified by evaluating the variation of the action under d(x) > (x) + 
H 

€ (X)Xu. 


The simplest example of a theory from the class (11.14) is the theory of a free 
massless scalar, Y = (1/2)(3„ġ)?. This amounts to cı = (—1)?/[2(D — 1)!] and 
discarding all the other operators contributing to (11.14). By (11.16), this leads to 
XY = 2c1g”” (—1)P (D — 1)! = g””. We then find in turn 


JË =$, NY = 846, Jy, = xð" h — ôo . (11.19) 


To illustrate the background gauging formalism, we need to gauge the parent 
symmetry. This is most easily done by first integrating the quasi-invariant Galileon 
terms by parts so that each factor of @ carries at least one derivative. All one then 
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has to do is to replace every ð p with Dyo = ð p — Apu. This gives the gauged 
action 


D 
Šio, A} = = > Cpl HnÀn+1 AD g1 Vn 
n=0 


(11.20) 


Àn+1 ÀD 


x f Pr Du PC) b T]9,, Py 0G) f APs AOD) 


i=2 


This is manifestly invariant under the simultaneous gauge transformation 5¢(x) = 
€1(x) and 8A, (x) = ð €11 (x). The latter corresponds to (11.8) with Fix = 0 and 
K 1 u= 67,5 no terms with higher derivatives of €; (x) are present. The consistency 
condition (11.10) is trivially satisfied with R” = 0, and (11.11) then gives 


8S{b, A} 


Ji'1¢, x(x) = — TEE 
u 


(11.21) 


vac 


It takes little effort to convince oneself that this exactly reproduces (11.17). 
To extract the descendant current J$, from the same gauged action, we set 
€1(x) = e} (x)x,, which changes the transformation rule for A,, to 


SAy(X) = E24 (x) + xvðp E (x) . (11.22) 


This matches (11.8) with Foz» = guv and K}, n = ôk Xa, and (11.10) then becomes 


Sio, A} 


3 REIG, x] (x) = Suv SA 
B vac 


= —Jiv[d, x]@) . (11.23) 


With the help of the first relation in (11.13) this is seen to be solved by RË = -N}. 
The master Eq. (11.11) then gives JA, in agreement with the current equivalence 
relation, that is the second identity in (11.13). 


11.3.2 Spacetime Translations and Rotations 


In relativistic theories with only scalar fields, spacetime translations and rotations 
can be implemented through their action on Minkowski coordinates, 


xH > xh el tel xy. (11.24) 
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i vector ey parameterizes infinitesimal translations and the antisymmetric tensor 
eS ” infinitesimal spacetime rotations. The evolutionary form (11.1) of the transfor- 
mations induced on the fields yy! is obtained by setting, respectively, ! 


Siyi set F Hawi, spi= ae "FL = eb x, uy! . (11.25) 


Translations and rotations are locally equivalent via (11.2); the latter can be 
recovered from the former by setting 


ef (x) = a te Coy MAC = 5% xp — Ôg Xa ; (11.26) 


The translations are the parent symmetry and the rotations its descendant. The 
Noether current corresponding to the parent symmetry is the canonical energy— 
momentum (EM) tensor T",, defined by 


8S = J dx TH iy, x] (x) OnE) (x) « (11.27) 


The left-hand side of the integrability condition (11.4) then becomes 
T” On fag = Tga — Tag - (11.28) 


It is known that in purely scalar theories, the canonical EM tensor is symmetric; 
I will justify this below using the background gauging approach. The integrability 
condition is then satisfied by Ne ap = = 0. The equivalence relation (11.5) in turn gives 
the Noether current for spacetime rotations, that is the angular momentum tensor, 


M” 


ap = Sop Ty = xpT "u — XaTs - (11.29) 


B 


This is just the familiar relation between momentum and angular momentum. 


The equivalence relation (11.29) does not apply to theories of fields with 
nonzero spin without further qualification. Namely, in such theories, space- 
time translations and rotations are not automatically locally equivalent. 
The reason for this is that the components of fields with spin undergo a 
transformation under spacetime rotations, which cannot be reproduced by the 


(continued) 


' The factor 1/2 in 8)! is conventional and compensates for the fact that the components a 


and a “ with fixed a + 8 parameterize the same transformation. The derivation of the angular 
momentum tensor M am g below assumes that the same factor 1/2 is used in its definition via (4.7). 
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naive local translation, 8; Y’ (x) = —ef (x) ap Y (x), for any choice of e} (x). 
The problem can be circumvented by a judicious choice of the localized 
transformation under translations. Instead of developing a general theory, let 
me illustrate the idea on a simple example. 


Recall Example 4.3 where I introduced the following theory of a real scalar field 
and a vector field A,,, 


1 
L = ANd — SAM Ay. (11.30) 


We found that the canonical EM tensor of this theory is not symmetric. However, 
we could solve the problem by considering a modified local translation, 


dipa) = —e xap ax), 81 Ap (xX) = eE (*)OVAp(X) — Av (x) dpe; () . 
(11.31) 


This is motivated by the fact that under a general local coordinate transformation, 
A, should transform as a covariant vector. Noether’s theorem then gives the 
improved, off-shell-symmetric EM tensor 


T” = gZ + AŁ A” — (AM ad + A’a"®) : (11.32) 


The prescription (11.31) restores local equivalence of spacetime translations 
and rotations in spite of the presence of the vector field. Namely, inserting 
el (x) = (1/2) fie (a)es” with constant e3% in (11.31) gives 


ef a=)” xy 
81 Ay (x) ———> E xg Dy An) + Erm A”). (11.33) 


This is exactly how A, should transform under a spacetime rotation. We can then 
localize both translations and rotations by using (11.31) augmented with (11.26). 
Since TH” is symmetric, the integrability condition is now satisfied and we 
reproduce the angular momentum tensor by replacing (11.29) with 


Mg = xpT", — Xa T"; . (11.34) 


Let us now address the relation between spacetime translations and rotations 
within the background gauging formalism. We start by gauging the parent symme- 
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try: spacetime translations. This amounts to promoting the flat Minkowski spacetime 
to a possibly curved spacetime manifold. Here it proves useful to invoke the 
differential-geometric language developed in Appendix A, to which I refer the 
reader for the basic terminology. Thus, the geometry of the spacetime manifold can 
be characterized by a local coframe (basis of differential 1-forms), e*“ = erty ? 
Under an infinitesimal coordinate transformation, x“ —> x” + et (x), the coordinate 
components of the coframe change as 


Ser (x) = —e] (x) Over (x) — &*(x)d,E] (x) . (11.35) 
This matches (11.8) provided we identify F# , = —dye% and Kf”, = —d/e%". A 


gauge-invariant action S{w, e*} is obtained from S{w} by replacing derivatives of 
yw! therein with covariant derivatives, and the flat Minkowski metric &uv With the 
spacetime metric, guy (x) = gapiz eE). An appropriate volume element is 
built using the determinant of the coframe ere (x) as a matrix. 

Eventually, we want to remove the background and go back to the flat Minkowski 
spacetime. I assume that the latter is described by global coordinates x“ in which 
the components ee are constant. Then the consistency condition (11.10) is solved 
by RË = 0, and (11.11) reduces to 


TY Ly, x(x) = efx) eae | : (11.36) 
bej) vac 


Suppose now that the theory at hand can be coupled to the background geometry 
in a way that the action S{y, e*} depends on a only through the spacetime metric, 
guv. This is an example of a higher-rank tensor background hinted at above. Then 


the variation of the action (11.9) is replaced with 


F 8S 
8,5 = pars 5z (~€? ds guv — ZavðuE? — Buadvet) . (11.37) 
uv 


On a flat Minkowski background, this automatically vanishes for constant et thanks 
to ASuv hac = 0. The EM tensor is then identified as 


SS{y, g} 


uv _ 
T” [w, x](x) = 2 Si) 7 


(11.38) 


This is the Hilbert EM tensor. Should our theory be Lorentz-invariant and only 
contain scalar fields, it can always be coupled to the spacetime background using 
the metric g,,,. At the same time, the scalar fields retain the transformation rule 


? Following the notation introduced in Sect. 10.2.3, I use underlined indices to indicate components 
within a local (co)frame. Ordinary Lorentz indices refer to a set of local spacetime coordinates, x”. 
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yia) = —e} Œ) yi (x). This guarantees that the canonical and Hilbert EM 
tensors coincide, and shows that the canonical EM tensor is necessarily symmetric. 


Theories of fields with spin can be coupled to background geometry in 
different manners, depending on how the spin index of the fields is treated. 
Let me illustrate this in the case of a vector field A,,. One already mentioned 
possibility is to treat this as a covariant vector that transforms under a general 
coordinate transformation via the second relation in (11.31). Itis then possible 
to construct a generally covariant action S{A, g} that only depends on the 
background through the metric. The resulting EM tensor (11.38) is symmetric. 

On the other hand, we may project the field on the local frame e% and 
treat Ay = e} A, as a set of spacetime scalars. This corresponds to keeping 
the naive transformation under local translations, Ag (x) = —e} (x)0, Ag (x). 
In this case, the gauged action S{A, e*} may depend explicitly on the local 
frame. We then have to use (11.36) instead of (11.38). This gives the canonical 
EM tensor which is not necessarily symmetric. 

The general moral is that different choices of gauging the symmetry may 
lead to different expressions for the Noether current. This is tantamount 
to different choices of the localized symmetry transformation (11.7). The 
resulting alternative Noether currents however only differ by terms that vanish 
on-shell, modulo the inevitable ambiguity with respect to contributions whose 
divergence vanishes off-shell. 


With the EM tensor at hand, we can extract the Noether current for any 
descendant symmetry that can be locally reproduced by spacetime translations (see 
e.g. Sect. 14.3 of [2]). Consider a one-parameter group of symmetries, generated by 
a vector field (x). The corresponding local transformation with parameter €2(x) 
can be written as x” —> x” + ef (x) with ef (x) = €2(x)E“(x). This turns (11.35) 
to 


der” = Ee dye, + eta”) "er Oe , (11.39) 
whence we extract Fg = —(E”3yer* + ef*0,&") and KE = =p Eee, The con- 


sistency condition (11.10) then boils down to 
j 1 
ð R” = -T",d,€" = AP Ouéy + ðv) - (11.40) 


In the last step, I assumed that T#” is symmetric, which we now know to be 
guaranteed in case the action S{w, e*} only depends on the coframe through 
the metric. But should &(x) actually generate a symmetry of the flat Minkowski 
spacetime, ð pv + 0)&,, must vanish by the Killing equation (A.92). Then (11.10) is 
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satisfied with R” = 0 and (11.11) gives immediately 
J iy, x(x) = T” iy, x] 0i) . (11.41) 


This confirms, among others, a general relation between the Hilbert EM tensor and 
the angular momentum tensor, anticipated in (11.34). 


11.3.3 Galilei Invariance 


The previous two examples were relativistic in spirit. Let us therefore have a look 
at one more example that is intrinsically nonrelativistic. Consider a theory of a 
complex Schrödinger field w(x, t) whose excitations are nonrelativistic particles 
with mass m. Suppose that the theory is invariant under spatial translations 
and under the internal U(1) group of phase transformations of the field. These 
symmetries act respectively on the spatial coordinates, x’ = x + e, and the field, 
Y’ = ew, and so correspond to 


òy (x,t) =iap(x,t) -—e-Vw(x,t). (11.42) 


Finally, it is well-known from elementary quantum mechanics that the Schrödinger 
equation for a free particle is invariant under Galilei boosts, 


w(x’, t) = Y (x + vt, t) = exp lim (» -x+ 5) w(x, t). (11.43) 


Here v is the boost velocity which plays the role of the transformation parameter. Let 
us assume that the invariance under (11.43) is inherited by our possibly interacting 
nonrelativistic field theory. 

The evolutionary form of an infinitesimal Galilei boost is 


sow (x,t) =imv-xw(x,t)—tuv-Vw(x,t). (11.44) 


This can be locally recovered as a combination of a spatial translation and a phase 
transformation if we set 


a(x,t) =mx-v(x,t), €(x,t) =tv(x,t). (11.45) 
In this case, therefore, the combination of spatial translations and internal U(1) is 
the parent symmetry, whereas the Galilei boosts are its descendants. The rest follows 
a familiar pattern. The integrability condition (11.4) requires that there is a local 


function N“"[y, x, t] of the field and its derivatives such that 


mJ” +T” = d,N" , (11.46) 
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where J“ is the current of the internal U(1) symmetry. The general equivalence 
relation (11.5) then gives the current B“” corresponding to Galilei boosts, 


BET = mx" JË 44TH" — NEY (11.47) 


Should N”” happen to be zero, the integrability condition (11.46) would boil 
down to T° = —mJ". Except for the sign convention in the definition of the two 
currents, this is the usual nonrelativistic relation between momentum and velocity. 
Moreover, (11.47) would imply B°” = mx’ J? + tT” = m(x"J®° — tJ"). This 
generalizes to field theory the well-known fact that the conserved charge for a Galilei 
boost of a particle of mass m is mx — pt, where p is the particle momentum. Whether 
or not, or under what conditions, N“” actually is zero needs to be inspected case by 
case. 

The story of Galilei invariance becomes very interesting, and nontrivial, once 
we consider background gauging. What we need is to gauge simultaneously spatial 
translations, Galilei boosts and the internal U(1) phase transformations. Since the 
Galilei boosts are descendant, we want to primarily gauge spatial translations and 
the internal U(1). The actions of these cannot be trivially separated as one could do 
in a Lorentz-invariant field theory. Namely, the gauged translations must affect the 
phase of the Schrödinger field in a way that reproduces (11.43). 

It turns out that the minimal setup that does the job includes a background 
spatial metric g,s(x) and a gauge field A,,(x) for the U(1) symmetry [3]. Under 
a combination of an infinitesimal local translation, x’ = x + €(x,t), and a U(1) 
transformation with parameter a(x, t), the background fields vary by 


8grs(x) = —e" (x) du grs (x) = Sus (x)d-E" (x) = Sru(x)dsE" (x) ; 
SAg(x) = doa (x) — € (x)ðu Ao (x) — Au (x) dpe" x) , 


SA, (x) = 0-a(x) — €" (x)ðu Ar (x) — Ay(x)0,-€"(x) — Mgrs (x)dE* (x) . 
(11.48) 


The first two lines take the expected form. The only nontrivial ingredient really is 
the last term on the third line of (11.48), which depends on the mass parameter m. 
This reflects the fact that the realization of spatial translations and Galilei boosts 
on a Schrödinger field is centrally extended, with m playing the role of the central 
charge. The associated nonrelativistic spacetime geometry is known as Newton— 
Cartan. See [4,5] for more details on nonrelativistic field theory in Newton—Cartan 
spacetimes, and [6] for broader background on nonrelativistic geometry. 

The localized transformation of the Schrédinger field itself is given by (11.42). 
By combining this with (11.48), we then generate at once the Noether currents of 
all the symmetries in terms of a gauged action Sty, g, A}. First, the current of the 
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U(1) symmetry is given by a trivial modification of (11.21), 


SS{W, g, A} 


SALG) , (11.49) 


JP Ty, x]œ) = — 


where “vac” now indicates setting g,, to the flat Euclidean metric 4,; and A, to 
zero. Next comes the EM tensor. By (11.27), the only components that the invariance 
under (time-dependent) spatial coordinate transformations gives us access to are 


SS{w, g, A} 
8A,(x) 


SS{w, g, A} 


Or _ 
T” Iy, x](x) =m grs (x) 


» Tiy, x](x) = 2 


vac 


vac 
(11.50) 


Note that (11.49) and (11.50) together imply a relation between momentum 
density and the particle number current, T°’ = —mJ". This corresponds 
to (11.46) with vanishing N“”. It might come as a surprise that the background 
gauge invariance yields a stronger constraint on the Noether currents than the 
original, physical symmetry. After all, background gauge invariance per se is 
merely a handy tool to encode the consequences of the symmetry. However, 
recall that the realization of Galilei transformations on the Schrödinger field is 
centrally extended. This is an obstruction that makes gauging of the symmetry 
nontrivial. It is ultimately the assumption that the gauging is possible that is 
responsible for the vanishing of N””. This does not add any new physics, 
but rather constrains the class of theories to which the background gauging 
procedure can be applied. 


Next, the current B/” for Galilei boosts can be identified by using (11.9) together 
with ôy S = fax BY Oyu". Upon inserting (11.45) in (11.48), we find that the 
consistency condition (11.10) is satisfied with RE = 0. This leads via (11.11) to 


or L r 88{¥, 8, A} 8S{v, g, A} 
B” ly, x](x) = —mx 3Ao(a) . + mt 34-0) . ; 
. vac . vac (11.51) 
rs = s SS{y, 8, A} dS{w, g, A} 
B’[w, x](x) = —mx 5A, (x) + 2t ET Ge) . 


These can be combined using (11.49) and (11.50) into BY” = mx” J” +tT*", which 
is just (11.47) without the N“” term. This is consistent with our conclusion above 
that N“” = 0 as a consequence of the assumption that the gauged action Siy, g, A} 
is invariant under (11.48). 
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11.3.4 Changing the Background: Magnetic Translations 


Our discussion in this chapter started with declaring the existence of certain 
symmetry and its subsequent localization. However, it is also possible to adopt 
the opposite approach. Suppose we know a priori the type of background gauge 
fields and the local transformations that make the action S{w, A} invariant. It is then 
possible to identify the original symmetry as the subset of the gauge transformations 
that preserve a trivial background. 


Recall the Galileon symmetry of Sect. 11.3.1. Out of all the gauge transformations 
Ay (x) > Ay (x) + 0,€(x), only those with constant e preserve the background, in 
this case regardless of the specific choice of A,,(x). Constant € corresponds exactly 
to the shift symmetry, acting on the Galileon scalar field ¢. 


In case of relativistic theories of fields with integer spin, we can consider as the 
background a generic spacetime metric g,)(x). The set of local general coordinate 
transformations that preserve such a background corresponds to (the continuous 
part of) its isometry group (see Appendix A.6.2 for the necessary mathematical 
background). In case of the flat Minkowski spacetime, this is just the Poincaré 
group, as follows from a trivial modification of Example A.15. Note that choosing 
the background geometry is not equivalent to fixing the coframe er as (11.36) 
might naively suggest. The reason is that the coframe is not uniquely determined for 
a given spacetime manifold. This underlines the necessity to carefully identify the 
geometric data, tailored to the given spacetime symmetry and uniquely representing 
the spacetime background. 


Once we have the gauged action S{w, A}, we may use it to study physics on 
nontrivial backgrounds. Indeed, the vanishing of the variation of 5 {w, A} under a 
localized symmetry transformation is an exact statement valid for any choice of 
A^., Imposing the equation of motion on the dynamical fields yê amounts to setting 
ôy = 0, which turns (11.9) into a generalized “conservation law,” 


èS pa ` 

D B ÑA B&A 

ese Ffu + Ove Kg, +) =0. (11.52) 
i on-shell 

For this to represent an actual divergence-type conservation law, the chosen 
background must possess some symmetry. This may however be different from the 
symmetry that we originally gauged by introducing the fields At in the first place. 
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I will illustrate this on an interesting example starting from Galilei invariance in 
d = 3 spatial dimensions. 

The flat nonrelativistic spacetime corresponds to g;5 = 6;s and A, = 0. The 
only infinitesimal transformations of type (11.48) that preserve this background are 
a linear combination of Euclidean translations and rotations, Galilei boosts, and the 
internal U(1) transformations. This is not obvious but is straightforward to check. 

Let us now change the background by introducing a uniform magnetic field B. 
Such a uniform background field should preserve some notion of translation 
invariance. This however cannot be the naive Euclidean translations due to the 
coordinate-dependence of the vector potential A(x) of the magnetic field. The 
details turn out to depend on the choice of gauge; I will adopt the symmetric 
gauge, A(x) = (B x x)/2. The translation invariance can now be rescued if it 
is accompanied by a local U(1) transformation. In terms of the Schrédinger field 
w(x, t), this so-called magnetic translation symmetry takes the form 


w(x’, t) = Y (x +e, t) = exp E “(x x B| y(x, t). (11.53) 


One remarkable consequence of this twisted translation symmetry is that the 
components of its generator P, no longer commute with each other. It is easy to 
check that they satisfy the commutation relation [P,, Ps] = —ié;s,B“Q, where Q 
is the generator of the U(1) symmetry. 

The magnetic translation symmetry (11.53) can be localized by promoting € to a 
function of spacetime coordinates and setting 


wr’ (x + €(x), t) = exp[—ie(x) - A(x) ] w(x, t), (11.54) 


where A(x) is now the spatial part of the variable A,,(x). At the infinitesimal level, 
this is equivalent to (11.42) with a(x) = —e(x)-A(x). Following the same reasoning 
as in Sect. 11.3.3, we then get modified relations for the EM tensor, 


T” Iy, x10) = —m JEW, x1) = 2A) O 


Siy, g, A} (11.55) 


grs (x) 


T™[w, x](x) = 2 2A*(x) JW, x](x)| 


vac ’ 
vac 


which generalize (11.50) to a uniform magnetic background. Here “vac” refers to 
the background vector potential, A, (x) le =(1 /2)8 Ersu Bx". The components 
of J” [y, x] are still given by (11.49). 

The magnetic translations illustrate in a very nontrivial manner many of the 
concepts introduced in Chap. 4 and this chapter. On the magnetic background, 
the Lagrangian of the Schrödinger field depends explicitly on the spacetime 
coordinates, yet possesses a nontrivial notion of translation symmetry. The Lie 
algebra of infinitesimal translations is centrally extended. The translation symmetry 
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can be gauged simultaneously with the internal U(1) symmetry by introducing a 
spatial metric and allowing the background fields to transform according to (11.48). 
This allows one to identify a nontrivial relation between the EM tensor and the 
Noether current of the U(1) symmetry, shown in (11.55). 


11.4 Application to Scattering of Nambu-Goldstone Bosons 


I will now return to the promise I made in Sect. 10.3.° Therein, I suggested that the 
enhanced soft limit of scattering amplitudes of NG bosons in the DBI and Galileon 
theories arises from the presence of extended symmetry. We now have all we need 
to be able to understand why. The material of this section loosely follows [7]. 


11.4.1 Galileon Theory 


Let us start with the simpler Galileon theory and recall the discussion of the 
Galileon symmetry in Sect. 11.3.1. In any translationally-invariant theory endowed 
with the Galileon symmetry, the parent current J E [ġ] will not depend explicitly on 
the coordinates. By the integrability condition in (11.13), the same holds for the 
local function N{'[¢]. Following the philosophy of Sect. 10.1, we now consider an 
arbitrary scattering process a — f. Using the translation property JË [o] x) = 
eiP x Jo] (Oe iP * and analogously N/[@](x) = e” * N Ip] (Oe iP, we 
deduce 


(BIJO la) = —ipp (BINS Td] la) , (11.56) 


where p = ph — ph. 

The general rules of polology dictate that when p” approaches the mass shell 
of a massless particle, the matrix element (6|J1,[@](O)|~) develops a pole. This 
corresponds to the emission of a NG boson with momentum p and allows one to 
extract the on-shell scattering amplitude Ay_, g42(p); cf. (10.1). The same reasoning 
can however also be applied to the operator N/'[@], leading to 


1 
(BINS IIO læ) = (ONF IAIO) l7 (P)) ott pepr) + Roa (P), (11.57) 


where Ray p) is by definition free of poles. By combining this with (11.56), we 


find a relation between Reavy (P) and the function Rea (p) in (10.1), 


Rpav(P) = ~iPpR gay (P) - (11.58) 


3 A reader interested primarily in the development of the general theory for spontaneously broken 
spacetime symmetry may proceed directly to the next chapter. 
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From (10.2) we then finally get an explicit expression for the on-shell amplitude, 


1 
Aa—>p+r(p) = p PeP” Rpa (P) ; (11.59) 


where F is determined by (0| J# [10x (p)) = iponF, and pbn = (|p| , p). 

The general relation (10.2) allows one to assert the existence of Adler zero, 
provided the function Rb x (p) remains regular in the soft limit p” — 0. A sufficient 
condition to ensure this is, as we know from Chap. 10, the absence of bilinear 
operators in the current t [¢]. This automatically translates into the absence 
of bilinear operators in Nb [@]. We then conclude from (11.59) without further 
assumptions that in the soft limit, the scattering amplitudes in the Galileon theory 
vanish at least with the second power of momentum. In other words, the soft limit 
of scattering amplitudes in the Galileon theory is enhanced with the soft scaling 
parameter o = 2. 


11.4.2 Theories with Generalized Shift Symmetry 


Having warmed up on the Galileon theory, we can now generalize the above 
argument to a much broader class of theories. For simplicity of notation, I will 
restrict the discussion to Lorentz-invariant theories of a single NG field @ that are 
invariant under the constant shift p — ¢ + € [7]. Further generalization to theories 
invariant under spacetime translations but mere spatial rotations can be found in [8]. 
A generalization to theories of multiple flavors of NG bosons presumably exists but, 
as far as I know, has not been worked out in the literature. 

Suppose that in addition to the constant shift symmetry, our theory is also 
invariant under a generalized shift of the type 


spa) = ef f(x) + Pig, x]@)} . (11.60) 


Here f(x) is assumed to be a polynomial function of the spacetime coordinates 
of degree deg f = n > 1. The local operator ®[¢, x] may depend explicitly on 
the coordinates as well, but is required to be polynomial in x” of degree lower 
than n. Examples of symmetries of this type include both the symmetry of the DBI 
theory (10.51) and the special Galileon symmetry (10.56). 

Let us now localize (11.60) and treat it as a generalized coordinate-dependent 
shift of the field, that is ò (x) = €[@,x](x) with čl, x] (x) = e(){f() + 
®[¢, x](x)}. The corresponding variation of the action is 


8S = f dPx JEE eLa) + O19, x1@)] 
(11.61) 


+ [f (x) + O16, x1) Jane} , 
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where if [ġ] is the parent current due to the constant shift symmetry. An integra- 
bility condition required so that (11.60) really is a symmetry of the action reads 


JT LPM) Ou Lf) + OLD, xw] = ILNI, x1) - (11.62) 


Unlike the parent current J i. [¢ġ], the local function N“[@, x] may depend explicitly 
on the coordinates. For (11.62) to hold, however, the polynomial degree of N“[@, x] 
in x” can be at most that of 0, f (x), that is n — 1. 

The next step is to rewrite (11.62) in the form 


IT (P1ef = Old, x13 J 1G] — 3u (Pld, xII [ol] — N“1d, x1) 


(11.63) 

= Off, x]ð J [p] + 3 M" Ip, x]. 
The new local function M"“[¢, x] of the field and its derivatives is again of 
polynomial degree at most n — 1 in the spacetime coordinates. It satisfies the 
translation property M“[@, x](x) = e'? * M“[, x](0)e~'”*, with the fields on the 
right-hand side evaluated at the origin but the explicit coordinate dependence left 
intact. Applying the polology rules to this operator then gives, in analogy to (11.57), 


(B|M"{[p, x1(O)|a) = (0|M“TG, x10) | P) Alaa tip) 
p ‘ (11.64) 


+ Rupa p). 


The coordinate dependence of the remainder function Riiga (x, p) comes entirely 
from that of M” [¢, x](0). Therefore, Ryg (X, p) is a polynomial function of the 
coordinates with degree at most n — 1. 

The crucial observation is that due to current conservation, the on-shell matrix 
element of ®[¢, x]o,J 4 [ġ] between any initial and final states, |æ) and |), 
vanishes. By combining (11.63) and (11.64), we then get a constraint on the 
remainder functions, generalizing (11.58), 


Rpa (Pu fŒ) = i Pu Ruga © P) + Op R Mga P) - (11.65) 


To understand the implications of this identity, we expand Rg, (Pp) and Re Ba (x, p) 
in a formal power series in p”. Let us denote their coefficients of order k in p“ 
respectively as ax and bx(x), and write (11.65) symbolically as 


ay: Of (x) = bg-ı (x) + 3 - Dg (x). (11.66) 


This must hold as an equality between polynomial functions of the coordinates. We 
now use repeatedly the fact that deg bg (x) < n — 1 for all k. Setting k = 0, we infer 
that ag = 0 and simultaneously 0 - bo(x) = 0, which implies that bo (x) is constant, 
deg bo(x) = 0. In case n > 2, we can next take (11.66) with k = 1. This can only 
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hold if a; = O and simultaneously 0 - bı (x) ~ —bo(x), that is, deg bj (x) < 1. By 
induction, we find that for any k < n—1, ag = O and simultaneously deg by (x) < k. 

The main conclusion is that for a generalized shift symmetry of the type (11.60) 
where deg f(x) = n, the series expansion of Rea (p) starts at order n in energy— 
momentum. By (10.2), the soft limit of scattering amplitudes of the theory is then 
enhanced with the soft degree o = n + 1. This confirms in particular that o = 2 for 
the DBI theory and o = 3 for the special Galileon theory. 
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A 
| Check for | 


updates 


In Part III of the book, I assumed that whatever the full symmetry of the system, 
only internal symmetry is spontaneously broken. Moreover, I only considered two 
possibilities for the symmetry of the underlying spacetime. For nonrelativistic 
systems, I assumed invariance under spacetime translations and spatial rotations; 
this symmetry is sometimes called Aristotelian. In case of relativistic systems, 
the Aristotelian symmetry is turned into Poincaré symmetry by adding invariance 
under Lorentz boosts. Both of these spacetime symmetries are easy to implement 
provided they are not spontaneously broken. Invariance under spacetime translations 
is ensured by demanding that the Lagrangian density of the theory does not 
depend explicitly on spacetime coordinates. Invariance under spatial or spacetime 
rotations can be guaranteed by contracting indices appropriately using ordinary 
tensor calculus. 

The situation becomes subtle when the spacetime symmetry is spontaneously 
broken, or when a different spacetime symmetry than Poincaré or Aristotelian 
is present. It is known that there are in fact multiple mathematically consistent 
kinematical groups including spacetime translations, spatial rotations and boosts [1]. 
Here I will consider in particular the Galilei symmetry, relevant for effective field 
theories (EFTs) of many nonrelativistic systems. Due to the mathematical structure 
of the Galilei group, the theory of its representations, and thus construction of 
Galilei-invariant actions, is highly nontrivial. For an overview of other possible non- 
Lorentzian kinematics and the associated spacetime geometry, see [2]. 

The EFT for broken spacetime symmetries finds natural application in cosmol- 
ogy. To make the methodology developed below useful also there, I will make an 
exception and initially allow for theories defined on a generic spacetime manifold. 
Some simplifying assumptions on the spacetime manifold will be necessary, but will 
only be introduced when needed. Otherwise, the general philosophy of construction 
of EFTs for spontaneously broken spacetime symmetry is the same as in Part III. I 
will start in this chapter by classifying possible nonlinear realizations of spacetime 
symmetry. In Chaps. 13 and 14, I will then utilize the results to build effective 
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actions. Unfortunately, there are no general explicit expressions for effective 
Lagrangians akin to those for internal symmetries, worked out in Chap. 8. I will 
therefore have to resort to outlining the basic algorithm and working out some 
illustrative examples. 


12.1 Reminder of Nonlinear Realization of Internal Symmetry 


The material of this chapter closely parallels Chap. 7, which the reader is advised 
to remind themselves of before proceeding further. In order to underline the basic 
logic, let me however give at least a brief summary of the steps we took in Chap. 7. 

We used the fact that internal symmetries in the sense of Sect.4.1.1 are point 
transformations that only act on fields and leave spacetime coordinates intact. This 
allows for a geometric reformulation of the problem of classifying all nonlinear 
realizations of internal symmetry. Namely, if the fields take values from a manifold 
M, then realizing the internal symmetry group G on them amounts to defining an 
action of G on M. While the action of the symmetry group is assumed to exist 
globally on the whole manifold M, its explicit expression requires a set of local 
coordinates. Thus, all statements depending on a specific choice of coordinates are 
without further qualification valid only in a local coordinate patch. 

We start by picking a point Yọ € M.! The subgroup Hy, of elements of G that 
map Wo to itself is called its isotropy group. Whenever Hy, is compact, there are 
local coordinates (7, x£) on M such that wo corresponds to 7“ = x}? = 0, and 
Hy, acts separately and linearly on 2“ and x}. Moreover, the set of points (“, 0) 
spans a submanifold of M, equivalent to the coset space G/Hy,. This makes it 
possible to encode zr“ in a unique representative element U (zr) of the corresponding 
left coset of Hy, in G. The representative element can be chosen so that the point 
wo is mapped to U(O) = e. In addition, the linear representation of the isotropy 
group Hy on sr“ is realized by the adjoint action, U (x) > hU(t)h7!,he Ay. 
Altogether, the action of G on M in the standard coordinates (x°, x°) is fixed by 
the choice of representative U (zr) and of the linear representation of Hy, on x°. 

In physics terms, every point Wọ € M defines a specific value of the order 
parameter for spontaneous symmetry breaking (SSB). The isotropy group Hy, 
corresponds to the unbroken subgroup of G. Once treated as maps from the 
spacetime to M, 2“ become the Nambu—Goldstone (NG) fields implied by SSB, 
whereas x°? are usually called matter fields. The same symmetry-breaking pattern 
G — H may be realized by different order parameters, and thus on different 
target manifolds M. This however only affects the number and type of matter 
fields present. The NG fields are identified unambiguously as coordinates on the 
coset space G/H, and are in a one-to-one correspondence with the generators of 
broken symmetry. 


1 In Chap. 7, I used the symbol xo instead of yo, which I will now reserve for a reference point on 
the spacetime manifold. 
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Moving now to spacetime symmetries, we will be able to follow the mathematical 
construction of nonlinear realization of internal symmetry very closely. We will only 
have to introduce one important modification, taking into account the distinction 
between fields and coordinates. However, the mapping to the physical concepts of 
SSB and NG bosons will turn out to be much more intricate. 


12.2 Spacetime Symmetry as a Point Transformation 


Consider a theory of a set of bosonic fields yŻ living on a D-dimensional spacetime 
manifold M. It is temporarily convenient to treat y’ and the spacetime coordinates 
x” on equal footing as independent variables spanning the product manifold M x 
M. This allows us to view the action of a spacetime symmetry (in the sense of 
Sect. 4.1.1) as a special case of a point transformation on M x M. See Fig. 12.1 for 
a visualization. Formally, the action of a spacetime symmetry group G on M x M 
constitutes a set of maps Tg, or equivalently functions F! and X#, such that 


Tg : (Wi, x") > ("x") = (F, x, g), Xt (x, 8)), gg eG. (12.1) 


The functions FÌ and X# cannot be chosen arbitrarily, but have to respect the 
structure of G. First, the unit element e € G must be realized by an identity 
transformation, 


Te=id & Fify, x,e)=yi, X"(x,e)=x". (12.2) 
y 
g 
pya- 7 F(X) 


| g F(W1,X0,g)3 


Xo X(Xo,g)3 xX 


Fig. 12.1 Visualization of the action of a spacetime symmetry on the product manifold M x M. 
The transformation of the spacetime coordinates x“ is independent of the fields y’. For fixed xo € 
M and g € G, the symmetry thus acts as a map between the slices M x {xo} and M x {X(x0, g)}, 
indicated by the dashed lines 
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Second, the group multiplication law dictates that for any g1, g2 € G, 


T, 


F (W, x, g1g2) = F'(F(W, x, 92), K(x, 82), 81) , 
gig 7 Tg o Tos 


KX" (x, 8182) = X" (X(x, 82), 81) - 


(12.3) 
Finally, for any g € G the map T,-ı must be the inverse of T,, 
7 F (F(Y, x, 8), Xx, g),8 =v", 
T,-1 = (Tg) 4 (12.4) 
XEK, g), 8 ) =x". 


The special case of (12.1) where y" = Fi (y, g) and X" (x, g) = x” corresponds 
to an internal symmetry.” Another special case is a purely spacetime transformation, 
where Fi (Y, x, g) = y’ for all y € Mand x € M. It follows from the conditions 
on the functions X*# in (12.2)-(12.4) that purely spacetime transformations form a 
subgroup of the symmetry group G. Here I will make a technical assumption that 
will prove essential for setting up standard coordinates on the product manifold 
M x M. Namely, I will assume that the subgroup of G of purely spacetime 
transformations acts transitively on every slice {Ww} x M with fixed y € M. Loosely 
speaking, this requires that the spacetime manifold M has sufficient symmetry that 
turns it into a homogeneous space. Now choose an arbitrary x9 € M and keep it 
fixed. By our assumption, for any x € M there is a purely spacetime transformation 
T xx € G that maps xo to x. Typically, there will be multiple such transformations; 
we then have to choose one 7,,-,, by convention. For flat spacetimes, it is natural 
to use a translation connecting the two points. However, we do not have to be that 
specific at this stage, keeping in mind that M may be curved. 

Eventually, we would like to treat yŻ as functions on the spacetime. Mathemati- 
cally, this amounts to replacing the action of the symmetry group G on M x M with 
one on maps M — M. Such an induced action is easy to write down thanks to the 
assumption that the transformation of the coordinates is independent of the fields. 
Thus, the fields y’ (x) are mapped to y” “x, g) such that 


wv" (K(x, g), g) = FW), x, g), or equivalently 


: , (12.5) 
w(x, 8) = F RE, g7), K(x, 87’), 8), 


for any g € G. We will need (12.5) once we want to construct a G-invariant effective 
action for the fields. For the time being, however, I will pursue the analogy with 
Chap. 7 and focus on the action of G on the product manifold M x M. 


? Strictly speaking, my previous definition of a spacetime symmetry included the assumption that 
X! (x, g) A x", at least for some x € M. This assumption is however immaterial in the present 
context. It will actually be convenient to think of the formalism developed here as a generalization 
rather than a modification of that in Part III of the book. 
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12.3 Standard Nonlinear Realization 


For every point (Y, x) € M x M there is an isotropy group consisting of elements 
of G that map the point to itself, 


Awy.x) = {h € G|Th(w, x) = (Y, x)} . (12.6) 


The setup of nonlinear realization of internal symmetry as reviewed in Sect. 12.1 
would naively suggest that we now choose a point (Yo, xo) € M x M. In its 
neighborhood, we could then establish the standard nonlinear realization of G 
in the usual manner. There are however two problems with this naive approach. 
First, we eventually want to treat fields as functions, defined globally on the whole 
spacetime M, as long as global coordinates on M exist. Second, the fields should be 
functions of the original, physical coordinates x”, not of some new variables whose 
dependence on y and x“ is beyond our control. 

In order to reach these goals, we take an intermediate step, introducing an 
isotropy group of a chosen spacetime point, 


Hy = {g € G |X" (æ, g) =x"). (12.7) 


Obviously, Hy,x) is a subgroup of Hy for any y e M. Moreover, having fixed 
a reference point x9 € M, we can use the maps Txọ—x to show that, in analogy 
with (7.7), isotropy groups at different spacetime points are related by conjugation, 


Ay = yes a Ay,x) = fee: ae : (12.8) 


xox? 


Example 12.1 


Let y be a complex Schrödinger field so that M ~ C, and let M be the flat Galilei 
spacetime. The action (11.43) of a Galilei boost with velocity v corresponds to 


(y, x,t) > (explim(v- x + v°t/2)]}y, x + vt, t) . (12.9) 


We assume that in addition to boosts, the symmetry group G also includes spacetime 
translations, spatial rotations, and the internal U(1) symmetry of phase transforma- 
tions. For x = (Xo, fo) = (0, 0), we find Hyo ~ [SO(d) x R¢] x U(1), consisting of 
spatial rotations, Galilei boosts, and the phase transformations. In the special case of 
Wo = 0, Hayo,xo) ~ Hxo. Otherwise, for any Wo Æ 0, Hiyo, xo) X SO(d) x R? C Hgy 
The isotropy groups for other spacetime points are obtained by conjugation (12.8) 
using the spacetime translation Ty). (Y, x’, t) = (Y, x! +x — xo, t' +t — to)? 


3 In order to avoid cluttered notation, I use here and in the following the same symbol Txọ—x to 
denote both the element of G and the corresponding map on M x M. 
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For any fixed x € M, the transformation of (Wi, x”) under g € Hy is determined 
by the functions Fi (W, x, g). In other words, these functions define an action of Hy 
on the slice M x {x} ~ M. This makes it possible, following now Chap. 7 verbatim, 
to introduce a standard set of field coordinates on M. Together with the spacetime 
coordinates x“, this defines a parameterization of the whole manifold M x M, in 
which the action of G takes a standard form. In order to keep the main results of 
this chapter together in one place, I will first give a concise overview of the ensuing 
standard realization of spacetime symmetry. Afterwards, I will stress some of its 
subtleties. A number of examples is worked out in detail in Sect. 12.4; the reader 
may want to consult these alongside the formal construction developed below. 


12.3.1 Summary of the Construction 


Consider the action of a continuous group G on the product manifold M x M via 
a set of point transformations of the type (12.1). It is assumed that G includes a 
set of purely spacetime transformations that act transitively on every slice {Ww} x M 
with fixed y € M. I will for simplicity choose the reference point x9 € M with 
Xo = 0 and denote as Ty = To—x the purely spacetime transformation that maps 
xo tox € M. 

The slice M x {0} ~ M carries an action of the isotropy group Ho. Choose 
a fixed point Yo € M. Assuming that the isotropy subgroup H(yg,0) is compact, 
it is always possible to find coordinates (7%, x°) on M x {0} with the following 
properties: 


e The point Wo corresponds to (°, x£) = (0, 0). Also, the set of points {(2%, 0)} 
spans a submanifold of M x {0}, equivalent to the coset space Ho/H(y,0)- 

e The coordinates 2“ parameterize uniquely a representative U (m) € Hp of the 
corresponding left coset of H(yp,0) in Ho such that U (0) = e. 

° The subgroup H(yo,0) acts on Ho/ H(yo,0) by adjoint action, U (x) > hU(x)h7! 
with h € A(y,,9), which induces a linear transformation of the coordinates m’. 

° The whole group Hg acts on M x {0} by left multiplication as 


U(r) $ U(x' (a, g0)) = goU (x)h(x, go)! , 
(12.10) 


& 
x2 S x(x, m, go) = D(A(a, 80))% 5X? » 


where go € Ho and h(z, go) € Hiyo,0). Moreover, D(h) is a matrix representa- 
tion of Hiyp,0). Altogether, the action of Ho on M x {0} is fixed by the choice 
of representation D of H(y,o) and the choice of parameterization U (7) of the 
coset space Ho/ A(yo,0)- 


This finishes the setup of coordinates and the action of Ho on the domain 
Quy X {0}; Qy is a neighborhood of yo in M where the coordinates (x1, x?) are 
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Fig. 12.2 Schematic 
visualization of the domain 
(shaded area) on the manifold 
M x M where the standard 
coordinates (2“, x8, x”) are 
well-defined. The domain 
Qyy C M carries a local 
coordinate patch with the 
field variables (z1, x£) 


well-defined. As the next step, we transport the coordinates (7%, x°) to the whole 
spacetime using the maps Tx. This leads to coordinates (7%, x}, x“) on the domain 
Quy, X M (see Fig. 12.2). Intuitively, we require that all points of the slice {y} x M 
with fixed Y € Qy, have the same coordinates 2“, x}. Technically, we set 


(f, x°, x") = 7,0", x°, 0). (12.11) 


By (12.8), this fixes the action of the isotropy group H, on M x {x}. Indeed, for any 
8x € Hy there is a unique go € Ho such that gx = Tx 807, ! This leads to 


Te, (1%, x°, x) = Ty 0 Teo (0%, x°, 0) = Tx (0 (x, 80), X° (X, 7, 80), 0) 


= (x (x, go), x, T, go), x”) : 
(12.12) 


Moreover, (12.8) guarantees that for any gy € A(wy,x), 0 € Hcw,0). Therefore, the 
isotropy subgroup H(yọ,x) is realized on the slice M x {x} by linear transformations 
of x“ and x® for any x € M. 

The structure we already have extends uniquely to an action of the entire group G 
on the entire manifold M x M, or at least on Qy, x M. Namely, note that the action 
of any g € G on a chosen point (w, x) can be composed of the action of an element 
of H, and a purely spacetime transformation, see Fig. 12.3. Indeed, decompose 


§ = UPR T PE = T> Xix, 2) 8x; g), (12.13) 


with the shorthand notation T xx = TaT, is er The group element g,(x, g) 
maps (w', x") to (F'(w, x, g), x”) for any y € M and thus belongs to Hy. It corre- 
sponds to an element go(x, g) € Ho by go(x, g) = T7! g (x, 8)Tx = T xix, e8 Tr 
Together with (12.12), this leads to the final expression for the action of G within 


Zy x M, 


Ty (x%, x2, x") = (a0 (ar, g0, 8)), x(x, go, 8)), R*(x,8)). 0214 
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Fig. 12.3 For any fixed point A 
(w,x) € M x M, the action 

of a chosen element g € G F(W,x,g) 
can be composed from 

actions of an element of Hy 

and a purely spacetime 

transformation 


This completes the setup of the nonlinear realization of G. I assumed an a priori 
knowledge of the transformation properties of the spacetime coordinates, that is the 
functions X” (x, g). With this provision, the action of G is uniquely fixed by the 
structure of the isotropy groups Ho and Ay),o), and by the representation D(h) 
of A(y,0). In the process, we made some choices, including the choice of the 
representative U (7x) parameterized by x“ and of the element Ty € G representing 
a spacetime point x € M. These are just two sides of the same coin, pertinent 
respectively to the manifolds M and M. 

Complicated as it might seem, (12.14) actually encodes in a rather simple manner 
the action of G by left matrix multiplication. Indeed, let us represent the point 
(zf, x8, x") formally by 7,(U(z), x£). Then left multiplication by any g € G 
gives 


Tg oF, (U(x), x£) = T X(x,2) o Teo, g) (U CT), x®) 
= Txa g) (U a'r, go(x, 8AE, go(x, 8)), (12.15) 
D(h(a, go(x, 8)))% 5x") » 


which exactly copies (12.14) in the matrix notation of (12.10). This shows that it is 
not actually necessary to decompose every g € G into a product of an element of 
H, and a purely spacetime transformation as in (12.13). It is sufficient to observe 
that for any x € M and g € G, Tx 8T» fixes the origin of M and thus belongs 
to Ho. 


12.3.2 Relation to Physics of Broken Spacetime Symmetry 


Mathematically, the generalization of the standard nonlinear realization from inter- 
nal to spacetime symmetry was relatively straightforward, perhaps even deceivingly 
so. A number of its features thus deserve pointing out before we proceed to 
examples. 
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Universality of the Construction In Chap.7 we found out that provided the 
isotropy group H is compact, the standard nonlinear realization captures all possible 
actions of the symmetry group G. Furthermore, I remarked that even if H is 
noncompact, the construction still goes through, but is no longer guaranteed to be 
exhaustive, provided the coset space G/H is reductive. The same applies here with 
the necessary modifications. The universality of the standard nonlinear realization is 
ensured if H(y,,9) is compact. Even if it is not, the construction is still consistent, yet 
not necessarily exhaustive, if the coset space Ho/ H(y,,0) is reductive. Reference [3] 
gives an explicit example of a group action that cannot be obtained from the standard 
nonlinear realization. There G ~ SO(2, 1) acts on M ~ R, the isotropy subgroup 
being the noncompact affine group Aff(1) ~ Rt x R. 

A possible point of concern might be that I assumed an a priori knowledge of 
the action of G on spacetime coordinates. This is however not a source of any 
hidden ambiguity. For one thing, the action of symmetry on spacetime coordinates 
is usually known. It is the classification of NG fields and their transformation prop- 
erties that we are after. Moreover, even if unknown a priori, the functions X" (x, g) 
can be constrained following the same philosophy. Namely, restricting (12.1) to 
the coordinates x” gives a group of well-defined transformations on M. If desired, 
possible forms of these transformations can be classified using the setup of Chap. 7. 


Fate of Descendant Symmetries Recall the discussion of locally equivalent 
symmetries in Sect. 11.1. As explained therein, one can assume without loss of 
generality that the functions fg (x) vanish at the origin, x“ = 0. Provided the 
localized parent transformation in (11.1) does not involve any derivatives of ef (x), 
the action of the descendant symmetry at the origin vanishes. In the standard 
nonlinear realization, such a descendant symmetry therefore automatically belongs 
to H(y,0) and does not give rise to any NG variables 7“. 


Physical Unbroken Symmetry In contrast to the internal symmetry case, a single 
point (W,x) € M x M cannot unambiguously represent the order parameter 
for SSB. The order parameter rather corresponds to a function œ! : M > M. 
A particular value of the order parameter then defines a submanifold, M, = 
{(gi (x), x”) |x € M} c M x M. Accordingly, neither Hy nor Hg(x),x) can be 
identified with the subgroup of unbroken symmetries, Hy. The latter rather consists 
of all elements of G that preserve the manifold Mọ, that is map points on Mọ 
to other points on My. With the help of (12.5), this translates to the condition 
g (K(x, g) = Fi (p(x), x, g) for all x € M. This constraint is highly implicit; 
in practice it is usually much easier to check its infinitesimal form, 


Fi (p(x), x) — X4(x)d,9' (x) =0 foralx eM. (12.16) 
The functions F y and xt define the action of the generator Q4 of G. The 


reconstruction of the group Hy from the solutions to (12.16) is based on the 
correspondence between a Lie group and its Lie algebra. An example of a possible 
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pitfall this approach may fall into, in the context of isometries of a Riemannian 
manifold, is given in Appendix A.6.2. 


Suppose that the order parameter is constant, g} (x) = pj. It follows that the 
“vacuum submanifold” is Mọ = {go} x M. According to (12.1), the unbroken 
subgroup is 


Ho = {g € G | F (yo, x, g) = gh for all x € M}. (12.17) 


Any purely spacetime symmetry automatically belongs to Hy). In other words, 
purely spacetime symmetries can only be broken by a coordinate-dependent order 
parameter. On the other hand, for internal symmetries, the condition in (12.17) 
reduces to F' (go, 2) = på. Since internal symmetries by construction do not affect 
the coordinates x”, the definitions of H(gg,x) and Hy, in this special case coincide. 
At the same time, for internal symmetries we have trivially H, ~ G for any x € M, 
hence Hy/H(go,x) ~ G/Hg,. This verifies that for internal symmetries, the more 
general formalism developed here boils down to that of Chap. 7. 


Classification of Order Parameter Fluctuations For internal symmetries, inde- 
pendent fluctuations of the order parameter are in a one-to-one correspondence with 
the NG fields z“. It would therefore be tempting to conclude that in the more general 
case of spacetime symmetries, they are classified by the coset space Ho/ H,0). The 
latter however only counts the NG variables 2“, realizing nonlinearly the action of 
Ho. Our parameterization (12.11) of the manifold M x M also involves the action of 
the “translations” Ty. Being purely spacetime, these may be spontaneously broken 
if ð ug (x) 4 0 for some x € M. We will only be able to introduce the corresponding 
NG modes in Chap. 13 once we treat yt as maps M —> M. The resulting EFT can 
be expected to feature a nontrivial realization of spacetime translations. 

It is possible to be more explicit in the special case of a constant order parameter, 
g'(x) = gi,, where purely spacetime symmetries remain unbroken. Here we expect 
a low-energy EFT in terms of NG fields 2“ on which purely spacetime symmetries 
act trivially, m” (X(x, g), g) = x“ (x). The number of these NG fields is identified 
by the dimension of Ho/ H(gy,0). This generalizes our previous counting rule for NG 
fields of broken internal symmetry. 


Consider a theory of a single real relativistic scalar field @ that is invariant under the 
Galileon transformations, 


(p, x") “4S (b,x, €), x(a, €) = (Q He + eux, x"). (12.18) 
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The symmetry group is G ~ ISO(d,1) x R?+!, where ISO(d, 1) is the D- 
dimensional Poincaré group and the RP+! factor collects the constant and linear 
shifts of @ in (12.18). Suppose the dynamics of the theory is such that the ground 
state has a constant vacuum expectation value (VEV), (@(x)) = @. This order 
parameter breaks by (12.16) the entire Galileon symmetry, that is Hy ~ ISO(d, 1). 
The coset space G/Hy ~ R?+! obviously does not correctly identify the NG 
modes. After all, our theory has a Lorentz-invariant vacuum and a single scalar 
field. Indeed, we find that Hyp ~ SO(d, 1) x R?+! where SO(d, 1) is the Lorentz 
group. Likewise, Hiy9) ~ SO(d, 1) x R?. This only includes the linear shifts of @ 
in (12.18), which act trivially at x“ = 0. In the end, we thus find Ho/ Ayo) ~ R, 
corresponding to the symmetry under constant shifts of ø. This gives the correct 
number of NG modes: one. The symmetry under linear shifts of @ is descendant. 
While it constrains the effective Lagrangian, cf. Sect. 11.3.1, it cannot affect the 
physical spectrum. 


Domain of the Standard Nonlinear Realization As emphasized rather stub- 
bornly, the standard field coordinates (7%, x°) are in principle only well-defined 
in some domain Qy, C M. In order for our formalism to be able to capture 
all fluctuations of the order parameter, the manifold My had better lie entirely 
in Qy, x M. This is obviously the case when the order parameter is constant as 
in Example 12.2. It is however not guaranteed in general and has to be checked 
case by case. In fact, it is easy to imagine a situation where this condition is very 
nontrivial. Consider a system with an order parameter gÝ (x) where the isotropy 
group A(gcx),x) varies from place to place. This is in contrast to our setup where 
Hyo,x) for all x € M are mutually isomorphic. It may happen e.g. in systems where 
the order parameter for spontaneous breaking of internal symmetry vanishes at some 
spacetime points. We might then be lucky and still get away with our standard 
nonlinear realization. The price to pay is the presence of 2“-type variables that 
locally excite gapped modes in the spectrum, or x °-type variables that locally excite 
NG modes. I am not aware of any work in the literature that would systematically 
address this kind of situation. 


Example 12.4 


A two-dimensional superfluid vortex is a map y : R? — C, expressed in the polar 
coordinates ọ, 0 on R? as W(x) = f(o)el”’. Here f (o) is a strictly increasing profile 
function such that f (0) = 0 and the limit of f (o) for ọ — œ is finite. The nonzero 
parameter n € Z is the so-called winding number of the vortex. The target manifold 
M ~ C carries an action of G ~ U(1) under which w $ ely. This is an internal 
symmetry and thus Hj) ~ G ~ U(1). For wo = 0, corresponding to the core of the 
vortex, we find Hiyo, ~ G ~ U(1) as well. There are no NG variables, only two 
matter variables x°, equivalent to the real and imaginary parts of y. These are well- 
defined in the entire complex plane, hence Qy, = C. On the other hand, choosing 
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Wo # 0 as appropriate for the vortex away from the core leads to Hiyy..o) œ {e}. 
The coset space Ho/Hyy,0) ~ UC) is now parameterized by a single NG variable 
x. The local parameterization of M ~ C is completed by one matter variable x. 
One can view x, 7 as polar coordinates on the target space. As such, their domain 
of validity Ry, can be extended at most to C with a half-line starting at the origin 
removed. 


Some Literature on the Subject Nonlinear realizations of internal symmetry were 
classified in the pioneering work [4]. First attempts to generalize their results to 
spacetime symmetries appeared soon afterwards [5,6]. These early works addressed 
a narrow class of relativistic systems where an extended symmetry such as the 
conformal symmetry is spontaneously broken to the Poincaré group. They followed 
an approach, based on an abstract coset space wherein all spontaneously broken 
symmetries induce relevant, independent degrees of freedom. Formally, this can be 
viewed as working with a set of order parameters that is sufficiently large to prevent 
any symmetries from being locally equivalent. Alternatively, it can be interpreted as 
being “agnostic” about the choice of order parameter. It is then necessary to include 
in the EFT the maximum possible set of order parameter fluctuations that can be 
enforced by a given symmetry-breaking pattern. 

The price of this agnostic nonlinear realization is that it introduces extra degrees 
of freedom that at best correspond to gapped modes in the spectrum, and at worst are 
outright unphysical. Writing down an EFT solely in terms of the physical NG modes 
requires eliminating the superfluous degrees of freedom by an operational procedure 
known as inverse Higgs constraint (IHC) [7]. In spite of its obvious shortcomings, 
this framework has influenced the narrative surrounding spontaneous breaking of 
spacetime symmetries for decades. Much of the subsequent work focused on the 
machinery of IHCs and its physical interpretation. Namely, it has been known 
for some time that fluctuations of the order parameter, corresponding to locally 
equivalent symmetries, are redundant [8,9]. Eliminating the would-be NG degrees 
of freedom corresponding to descendant symmetries by imposing a set of IHCs can 
then be interpreted as gauge-fixing [10]. The alternative possibility that the modes 
eliminated by the IHC may be physical but are necessarily gapped was recognized 
in [11, 12]. I will elaborate on the two scenarios later when we have at hand 
explicit examples. Finally, much work has been done on the question to what extent 
imposing an IHC might interfere with the universality of the standard nonlinear 
realization [13-15]. The moral is that a point transformation relating different 
parameterizations of an EFT may become nonlocal upon imposing the IHC. 

As far as I know, the fact that the would-be NG fields of descendant symmetries 
need not be included in the nonlinear realization at all was first pointed out in [16]. 
The approach developed here makes this explicit by identifying the relevant NG 
variables 2“ with coordinates on the coset space Ho/H(y,0). This has a simple 
classical interpretation, whereby z“ represent the deviation of the order parameter 
from its local value. They can be further augmented with additional NG fields, 
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arising from the spacetime variation of the order parameter. To incorporate such 
additional NG fields will be the most challenging problem we will have to face in 
Chap. 13. 


12.4 Examples 


To illustrate the general construction of Sect. 12.3, I will now work out in detail 
several examples, covering a range of different symmetries. In all the examples, I 
assume flat (relativistic or not) spacetime where the map Ty— xw is realized by the 
unique translation that moves the point x € M to x’ € M. The isotropy group Hp is 
fixed by the transformation properties of spacetime coordinates, known a priori. The 
subgroup Ay,,0), on the other hand, may depend sensitively on the choice of the 
reference point yo. This is demonstrated by the examples in Sects. 12.4.2—12.4.4. 


12.4.1 Lorentz Scalars with Internal Symmetry 


A good starting point is to check that our new algorithm can reproduce what we 
know from before about internal symmetries. Consider for simplicity a theory of a 
set of relativistic scalars y’ with a symmetry group G ~ ISO(d, 1) x Gint. Here 
ISO(d, 1) is the Poincaré group, acting as a purely spacetime symmetry, and Gint is 
an internal symmetry. The actions of the purely spacetime and internal parts of G 
are completely independent of each other. Mathematically speaking, ISO(d, 1) and 
Gint possess a well-defined action on, respectively, M ~ R? and M. Both actions 
are trivially extended to M x M by embedding. 

It follows at once that Ho ~ SO(d, 1) x Gint and Hiyo,0) ~ SO(d, 1) x Hint, 
where Hint C Gint is the isotropy group of the point Yọ € M. Unsurprisingly, we 
end up with a set of NG variables 1“, parameterizing the coset space Ho/ Heyo,0) S 
Gint/ Hin. By (12.10) and (12.14), a group element (gst., gint) € ISO(d, 1) x Gint 
acts on the standard coordinates (7%, x}, x“) via 


(8s.t.» Zint) 
D 


U(r) U (7' (xr, gint)) = 8imU Thit, gin)! , 


(8s.t.»8int) 
x2 EEN, ACX, T, gin) = D (him (T, 8in))?o X? » (12.19) 


yh EBD, WC, gee) = (Top 0)" - 


Here hint € Hint and D (hint) is a matrix representation of Hint. Finally, T,,. defines 
the action of the Poincaré group on the Minkowski spacetime. This is of course 
well-known, so the abstract notation just takes explicitly into account the possibility 
of using other coordinates x” than Minkowski. 
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This might look like a mere idiosyncratic reformulation of the standard 
nonlinear realization of internal symmetry as laid out in Chap. 7. Indeed, 
should the order parameter be constant, g! (x) = ph, the unbroken subgroup 
is Ho ~ ISO(d, 1) x Hin so that G/Hg, ~ Gint/ Hin, in accord with Exam- 
ple 12.2. However, the parameterization of M x M by (mf, x}, x”) and 
the corresponding group action (12.19) are also valid for order parameters 
gÏ (x) with arbitrary coordinate dependence. The true unbroken subgroup Hy 
can then be smaller than ISO(d, 1) x Hint and in extreme cases even trivial. 
This is not mere pedantry. For instance, dense relativistic matter can often 
be described by a time-dependent condensate of scalar fields. Although such 
a background spontaneously breaks boosts, our construction guarantees that 
one can parameterize its fluctuations by the same degrees of freedom as in a 
Lorentz-invariant vacuum. A similar remark applies to all the other examples 
discussed below. 


The same setup can be used even if we replace ISO(d, 1) with any other purely 
spacetime symmetry group that contains spacetime translations. The message is 
fairly simple: it does not matter what the spacetime symmetry is, as long as it does 
not act directly on the fields. 


12.4.2 Lorentz Scalars with Scale Invariance 


Next, we look at a simple example of a symmetry whose actions on coordinates 
and fields cannot be trivially separated. Consider a set of relativistic scalar fields 
w', carrying the action of G ~ R* x ISO(d, 1). Here ISO(d, 1) is again the 
Poincaré group. What is new is the factor R*, representing scale transformations of 
the spacetime coordinates. It would be possible to add an internal symmetry factor 
Gint in the same way as in Sect. 12.4.1, but I will not do so to keep the notation 
simple. 

It is now convenient to fix the spacetime coordinates x“ as the standard 
Minkowski ones. The maps T% are then realized explicitly as Ty = e”’? where 
P, is the generator of spacetime translations. The action of translations on the 
Minkowski coordinates is x” (x, €) = e€? x” = x” + e”. Likewise, an element 
e” e R* with real parameter œ is realized on the coordinates as x” (x,œ) = 
elt x4 — ex", where D is the dilatation operator. Accordingly, the action of 
the dilatation group R™ on the Poincaré group is fixed by the commutation relations 
[D, Pu] = —iP, and [D, Jw] = 0. 

It is now clear that Hy ~ Rt x SO(d, 1). Since Lorentz transformations act 
trivially on scalar fields, we have only two options for H(yọ,0), depending on 
whether or not Wo also preserves scale invariance. Let us start with H(y,0) ~ Rt x 
SO(d, 1), which corresponds to a scale-invariant order parameter wo € M. This 
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implies Ho/H(y,0) = {e}, hence there are no NG variables x1, only matter fields. 
These should carry a linear representation of the dilatation symmetry. Since R* only 
has one-dimensional irreducible representations, we can always choose a basis x° 
of coordinates on M x {0} such that x’? (x, œ) = el? x? = exp(—aA,)x2.4 The 
parameter A, is the scaling dimension of x°. The action of dilatations on the entire 
manifold M x M ~ M x RP is then given by 


(x2, x") S Xex, a), x” (x, @)) = (exp(—aAg) x2, e* x") . (12.20) 


It is instructive to see how this transformation rule is reproduced by (12.15). Namely, 
the commutator [D, P,,] = —iP, together with the Hadamard lemma (7.30) gives 
go? Pe =e"P, or eD elie PeieD — exn(ie%e - P). (12.21) 
This ensures that for g = eP, go(x,g) = eP for any x € M. The scale 
transformation of x? is independent of the spacetime coordinate as it should. 
Mathematically, the conjugation relation (12.21) boils down to the fact that the 
translation generators P, carry a representation of the dilatation group RT. 

The other option for the isotropy group is Heyo, ~ SO(d, 1), in which case 
Ho/ Hyo, ~ R. This can be thought of as arising from a Lorentz-invariant but 
dimensionful order parameter. We now have one NG variable, the dilaton x, plus 
possibly a set of dim M-— 1 matter fields x}. With the exponential parameterization, 
U(r) = e"? , (12.10) tells us that 2/(z, a) = x + «œ and x(x, 7, œ) = x2. The 
extension of the action to other spacetime points works exactly the same as in the 
previous case. The final result for the action of dilatations therefore is 


(r, x8, x") S (n'r, a), x2 (x, 1, œ), x (x, @)) = (m +a, x8, eX x) . 
(12.22) 


Note that the scaling dimension of all the matter fields is now zero. This is just a 
matter of a choice of variables. Namely, in presence of the dilaton, any field Y? with 
scaling dimension A, can be redefined to x° = exp(Agz) VW. 

I have not spelled out explicitly the action of spacetime translations and rotations. 
However, these are purely spacetime transformations and only affect the spacetime 
coordinates, similarly to the last line of (12.19). 


12.4.3 Lorentz Vector with(out) Lorentz Scalar 


Another possibility how to make the action of symmetry on coordinates and fields 
entangled is to keep the Poincaré group G ~ ISO(d, 1), but take a nonscalar field. 


4 From now until the end of Chap. 12, a repeated index ọ does not imply any summation. 
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Consider for simplicity a single Lorentz-vector field, A”, defining a D-dimensional 
target manifold M ~ R?. We then have Hy ~ SO(d, 1), but the isotropy group 
H(4,,0) depends on the choice of Ab . There are four qualitatively different options: 
Ab = 0, or nonvanishing AG that is respectively timelike, lightlike, or spacelike. 

In the simplest case of Ab = 0, we find A(4,,0) œ~ Ho ~ SO(d,1) and 
consequently Ho/H(Ay,0) ~ {e}. There are no NG variables and the sole degree 
of freedom, A” itself, is of the matter type. It carries the vector representation of 
the Lorentz group, A“ — A’“(A, g) = A(g)",,A” for any g € SO(d, 1). Since the 
translation generators P, also carry the vector representation of the Lorentz group, 
we have a conjugation relation similar to (12.21) for dilatations. This guarantees 
that go(x, g) = g for any g € SO(d, 1) and x € M ~ R?. From (12.15), we then 
extract the expected result for the action of Lorentz transformations, 


To (A*, x") = (A (8), A”, A(g)" x”) , g € SO(d, 1). (12.23) 


Spacetime translations only affect the coordinates, x'” (x, €) = e€ Px! = x” + e”. 

For illustration, I will work out in detail one more special case. Suppose that A’ 
is nonzero and timelike. Without loss of generality, we can assume that Ao = asi 
with a Æ 0. Then H(4,,0 ~ SO(d), the group of spatial rotations. The coset space 
Ho/A(Ay,0) ~ SO(d, 1)/SO(d) is now a noncompact d-dimensional manifold, best 
viewed as the mass shell of a massive relativistic particle. A convenient implicit way 
to parameterize it is by treating it as the d-dimensional hyperboloid in M ~ R? 
satisfying the constraint A,A” = a’. That has the advantage of maintaining the 
linear transformation property (12.23) under all Lorentz transformations. This is just 
the noncompact version of the SO(d + 1)/SO(d) ~ S? coset space, conventionally 
parameterized by a unit vector n € R¢*!. 

If needed, it is possible to introduce explicit coordinates x” on Ho/H(4y,0), trans- 
forming linearly as a vector under H(4,),9) ~ SO(d). One suitable coordinatization 
arises from thinking of A” as the energy-momentum of a particle of mass |a|. The 
d independent coordinates x” are then the components of its spatial momentum. A 
pure Lorentz boost can be represented by e!”"* , where K is the boost generator and 
n the rapidity. The latter is parallel to the velocity v of the boost; its magnitude is 
determined implicitly by |v| = tanh |ņ|. The action of the boost on z” in terms of 
the velocity v reads 


’ Yw—-1 7 7 1 
„v) =m + . + + ; E oo 
m (m, v) =T Nç) (v -m)v + yva m4 sgna Vov Jae 
(12.24) 


The parameterization of the target manifold M ~ RP is completed by adding to 2’ 
a matter field x, transforming in the singlet representation of SO(d). By (12.15), 
it then automatically transforms trivially under the whole Poincaré group. The 
transformation of coordinates x” remains of course the same as in (12.23). 
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From internal symmetry, we are already used to the fact that the number and 
type of matter fields x? present depend on the choice of order parameter. It 
is only the NG fields x° that are fixed by the symmetry-breaking pattern. 
Here comes the surprise: for spacetime symmetries, even the number of 
NG variables 7“ may depend on the order parameter. For an example, take 
a theory of a relativistic complex scalar field ø where an internal U(1) 
symmetry is broken by the VEV (¢(x)) = g(x) = poe it with constant 
nonzero go and u. This state describes matter with nonzero density of the 
U(1) charge; u is the corresponding chemical potential. The state breaks the 
G ~ ISO(d, 1) x U(1) symmetry spontaneously down to Hy ~ ISO(d) x R, 
where ISO(d) now includes spatial translations and rotations and R stands 
for a combination of time translations and internal U(1) transformations. 
In line with our discussion in Sect. 12.4.1, there is a single NG variable, 
parameterizing the coset space Ho/H(y),0) ~ [SO(d, 1) x U(1)]/SO@d, 1). 
Now add a vector A” as a secondary order parameter. A constant background, 
(A*(x)) = Ao = ad", can be interpreted for instance as the VEV of the 
current of the U(1) symmetry. This does not affect the unbroken subgroup Hy, 
yet it does add a vector x” of NG variables. We will deal with this puzzle in 
Chap. 13. It will turn out that the number of physical gapless NG modes in the 
spectrum is still uniquely fixed by the symmetry-breaking pattern G > Hoy. 
The new variables x” represent either spurious, nondynamical degrees of 
freedom or dynamical but gapped excitations of the order parameter. 


12.4.4 Schrédinger Scalars with Galilei Symmetry 


To work out at least one nonrelativistic example, we now return to Galilei invariance. 
The action of the Galilei group on spacetime coordinates is well-known. A spatial 
translation by € shifts Cartesian coordinates by x — x + e€, and likewise t > t +€ 
represents a temporal translation by e. Accordingly, the translation operator Ty can 
be parameterized as Ty, = e“ eP , where H and P are the respective generators. 
A Galilei boost with velocity v acts on the coordinates as (x, t) > (x + vt, t), and 
will be represented by e!”* with K being the generator. One should also add the 
generator of spatial rotations, Js. However, since we will be mostly concerned with 
the boosts, I will not spell out the action of rotations explicitly. 

From the transformation of the coordinates, we extract the conjugation property 


eH Ke 4 = K—eP or [H,K]=iP. (12.25) 
The actions of spatial translations and boosts on the coordinates commute with each 


other. However, it is known that representations of the Galilei symmetry admit a 
central extension of the commutator [P,, K,]. Let us therefore introduce a tentative 
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central charge Q so that? 
[P,, Ks] =id;;0 or e€? Ke? = K-€QO. (12.26) 


The resulting central extension of the Galilei group is known as the Bargmann 
group. This has the structure G ~ SO(d) x {R} x [R? x U(1)o]}. The first factor 
stands for spatial rotations, R? for spacetime translations, and RI for boosts. 
Finally, I use the notation U(1)ọ for the transformations generated by Q although 
at this stage it is not clear, or even important, that the symmetry is compact. 
What matters is that the action of Q on the coordinates is trivial. This ensures 
Ho ~ [SO(d) x R4] x U()o. 

The isotropy group H(yg,0) depends sensitively on the type of fields included. I 
will initially restrict to rotation scalars, w', which makes the rotation SO(d) into 
a purely spacetime symmetry. Moreover, the Re group of Galilei boosts is now 
descendant, as we saw in Sect. 11.3.3. This is confirmed by an explicit classification 
of low-spin indecomposable representations of the Galilei group [18]. In the end, 
the only part of the symmetry whose action on y’ at the origin, (x,t) = (0,0), 
may be nontrivial is the U(1)g. We have a freedom to decide whether or not U(1)g 
belongs to H(y,,0) by a suitable choice of the reference point Yå. 

Let us first assume that Hy 0) ~ Ho so that Ho/ Hiyo, ~ {e}. It is then 
possible to find a set of complex field coordinates x? that form one-dimensional 
linear representations of U(1)ọ, x(x, œ) = eye — exp(iamg)x°, where mo 
are the corresponding charges of Q. This simple transformation property survives at 
all (x, t) since Q is a central charge and so commutes with spacetime translations. 
Likewise, all of the spacetime translations and spatial rotations will act solely on 
the coordinates. The only nontrivial piece is the action of Galilei boosts. This is 
determined with the help of (12.15) and the relation 


elt: K etH oix- P = et” piatur)-P exp i (» x + z1) o| elt K (12 27) 
2 i ` 
which follows from (12.25) and (12.26). The final result is 
eK (X8, x, t) = (explimo(v : x + v°t/2)]x°, x + vt, t) , (12.28) 
which reproduces using solely the Lie algebra of the Bargmann group the transfor- 
mation rule (12.9) I simply postulated before. We can see that the central charge 


Q, introduced above ad hoc, measures the nonrelativistic (rest) mass. In systems of 
identical particles of a fixed mass, this is proportional to the number of particles. 


5In d = 2 spatial dimensions, the Galilei group admits another, exotic central extension, 
[K;, Ks] = ixé;s. The parameter « is however related to two-dimensional spin [17], and I will 
thus drop it. 
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The other option for the isotropy group is H(y.,0) ~ SO(d) x Re. This implies 
that Ho/ Ayo) ~ UC)g. The particle number symmetry U(1)g is going to 
be realized nonlinearly as for instance in superfluids. We need one NG variable, 
a. With the exponential parameterization of the coset space, U (r) = e2, this 
transforms under U(1)g as 2’(z, œ) = x +a. In addition, there will be a set of real 
matter fields x°, invariant under U(1)g. Using again (12.27), we then find quite a 
different result then above, 


eK, x2, x, t) = (m +v x + u't/2, x°,x4+ vt, t). (12.29) 


This completes the range of options accessible with scalar fields. Let us see at 
least briefly what may happen when higher-spin fields are present. For simplicity, 
consider a single multiplet A” = (A?, A) that transforms under boosts as a Galilean 
vector [18], A” (A, v) = e™ £ (A?, A) = (A?, A + vA®). Similarly to Sect. 12.4.3, 
one can think of (A“(x)) = Ao (x) as the VEV of the density and current of the 
U()o symmetry, respectively. In this interpretation, a state with Ao (x) = (a, 0) 
and constant a + 0 corresponds to uniform dense matter in its rest frame. 
With this secondary order parameter, the isotropy group Hijo, 49),0) is reduced to 
SO(d) x U(1)9 or SO(d), depending on whether or not the primary order parameter 
Wo preserves U(1)g. The coset space Ho/ AH ((yo,Ao),0) then necessarily includes a 
noncompact d-dimensional submanifold R, carrying a vector of NG variables, 
E". The natural parameterization is U (£) = e'6'* . Following our algorithm for the 
standard nonlinear realization, we find that (12.28) and (12.29) remain valid under 
the respective assumptions on U(1)g. The only change is a new transformation rule 
for £", 


U(E (E, v)) =e” * UE) = UE +v). (12.30) 


The modification of the realization of the Bargmann group by adding £” is nearly 
trivial, so why exactly have we done this? The reason will become clear in the next 
chapter where we address the problem of construction of invariant actions. Namely, 
invariance under the linearly realized isotropy subgroup Hy,,,0) has to be ensured by 
brute force using representation theory. The mathematical structure of the Bargmann 
group makes this a difficult task. With Galilei boosts realized nonlinearly by the NG 
field £”, all that remains is to impose invariance under spatial rotations and possibly 
U(1)Q, which is straightforward. 
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| Check for 
updates 


With the nonlinear realization of spacetime symmetry put forward in Chap. 12, 
we are now in the position to construct the low-energy effective field theory 
(EFT). As I already occasionally stressed, the EFT machinery for spontaneously 
broken spacetime symmetry is less developed than that for internal symmetry. As 
a consequence, I will not be able to offer an explicit general expression for the 
effective Lagrangian akin to what I did in Chap. 8. Instead, I will describe the basic 
framework in an algorithmic fashion, and then work out some illustrative examples. 

In this chapter, I only consider systems where the symmetry group G is known 
and dictated by the microscopic dynamics. This has been the underlying assumption 
of the whole book so far. Yet, there are physical systems whose low-energy physics 
may feature additional symmetry which itself may but need not be spontaneously 
broken. Such symmetry is usually called emergent, and appears among others in 
classical systems such as fluids or solids. I will defer a detailed discussion of such 
emergent symmetries and their spontaneous breaking to the next chapter. 

Following some preliminary work in Sect. 13.1, the survey of EFTs for sponta- 
neously broken spacetime symmetry starts in Sect. 13.2. Here I deal with systems 
where the values of the order parameter at all points in space belong to the same 
orbit of the symmetry group. This requires a minimal modification of the setup 
for internal symmetries, but already involves some of the subtleties associated 
with the spacetime ones. A major issue is that the number of Nambu—Goldstone 
(NG) variables 2“ depends on the choice of order parameter realizing the given 
symmetry-breaking pattern. This means that some would-be NG variables are not 
required by the symmetry-breaking pattern per se. In Sect. 12.3.2, I suggested that 
such fields may, but need not, be physical. In case they are, they necessarily excite 
gapped modes in the spectrum, not true NG modes. I illustrate this dichotomy by 
several examples in Sect. 13.3. Then I offer some comments on the inverse Higgs 
constraints (IHCs) as an operational prescription for eliminating such spurious 
degrees of freedom. 
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The generic case of systems where the order parameter may depend on coordi- 
nates in an a priori arbitrary manner is considered in Sect. 13.4. Here the methods 
available for the construction of EFT appear to be least developed. I will therefore 
have to content myself with a careful discussion of the special case of one- 
dimensional order parameter modulation, augmented with a couple of illustrative 
examples. 


13.1 Building Blocks for Construction of Effective Actions 


In Chap. 8, I showed that the effective action for NG bosons of spontaneously broken 
internal symmetry can always be expressed in terms of the associated Maurer- 
Cartan (MC) form. I am not aware of any proof of the equivalent statement for 
spacetime symmetries. I will however make the common assumption that this is 
still the case. 


13.1.1 Maurer—Cartan Form 


Recall that we deal generally with a set of bosonic fields y’, taking values from 
a target manifold M. The spacetime is treated as a possibly curved manifold M 
of dimension D, parameterized by local coordinates x“. Together, yÍ and x” span 
the product manifold M x M. In order to avoid excessive repetition, I will not 
review all the details of the standard nonlinear realization of spacetime symmetry. 
The reader is invited to consult Sect. 12.3 to refresh their memory. Let me just stress 
the important roles of the isotropy groups Hy of a spacetime point x € M, and 
Hoy x) of the point (Y, x) € M x M. For chosen fixed yo € M, the coset space 
Ho/ Hop,0) Spans a submanifold of Mx {0} ~ M, parameterized by NG coordinates 
mz“. These are generally accompanied by a set of matter variables x}. Together, 
(zf, x£, x”) constitute of a complete set of local coordinates on M x M. 

Let us first focus on the variables (7°, x”), required for a successful nonlinear 
realization of spacetime symmetry. I will represent these jointly as 


Ua, x) =T U (7), (13.1) 
where Ty is a fixed purely spacetime transformation that transports the spacetime 
origin to the point x (cf. Sect. 12.2). Moreover, U (7x) is a matrix representative of 


an element of the coset space Ho/Hyo,0). By (12.15), the action of any g € G on 
(zf, x") can then be expressed compactly as 


UC, x) > Ur" (or, go, 8)), R(x, 8)) = gU, xA, go, g), 13.2) 
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where go(x, g) = Tx, )8Tx- Moreover, 2“ (zr, go(x, g)) and h(z, go(x, g)) are 
defined by (12.10). The MC form can now be introduced through 


olr, x) = —iU(r, x) dU, x) 
(13.3) 
= iU (a) 'dU (x) + U (a) t (iT; dT) U (7) . 


The first piece that depends solely on x“ is familiar from our analysis of internal 
symmetries. The second piece is new and in some, as yet unclear, way reflects the 
action of the symmetry on the spacetime. Equation (13.2) induces the following 
transformation of the MC form, 


olr, x) > olr, g0), X(x, 8)) 


= h(x, g0)@ (r, x)h (r, go) | — ihr, go)dhCr, go) |, 


(13.4) 


where I for brevity dropped the arguments of go(x, g). 

In order to understand better the structure of the MC form, we need to make a 
digression. By definition of spacetime symmetry, the transformation of coordinates 
x is independent of the field variables yÝ (or 2“, x2). One can thus restrict 
the action of G to the spacetime manifold, where it is defined by the maps 
x! => x’! = X” (x, g). Under this restricted action, the spacetime behaves as a 
homogeneous space, M ~ G/ Ho. This implies the following local homeomorphism 
of manifolds, 


G > Aio,0) x Ho/ Aa,0) X G/Ho > AHeo,0) X Ho/Aayo,0) X M . (13.5) 


With the obvious notation for the Lie algebras of the three groups G, Ho and Hyp ,0), 
the corresponding isomorphism of tangent spaces reads 


BX Deyo,0) B Do/Deyo,0) © 9/bo - (13.6) 


Here }o/D(yo,0) and g/Ho should be viewed as shorthand notation for the respective 
tangent spaces; these are themselves not Lie algebras without further qualifications. 

I previously assumed that at the very least, the coset space Ho/H(yo,0) is 
reductive. This ensures that both By,,0) and Do /Byo,0) are invariant subspaces under 
the adjoint action of H(y,,0). In the following, I will need a stronger assumption, 
namely that all three components of the direct sum (13.6) are invariant under the 
adjoint action of H(y,,0). This is guaranteed if H(y,,0) is compact, but in general has 
to be viewed as an additional assumption. 


Example 13.1 


Let us check the validity of this assumption for the examples of nonlinear realization 
worked out in Sect. 12.4. In case of a theory of Lorentz scalars with an internal 
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symmetry group Gint (Sect. 12.4.1), we have Hiy,,0) ~ SO(d, 1) x Hint. As long as 
the coset space Gint/ Hint itself is reductive, the tangent space )o/D(yo,0) © Sint/ hint 
is naturally invariant under the adjoint action of H(y,,0). Likewise, the tangent space 
g/bo of G/Ho ~ M ~ RP carries the vector representation of SO(d, 1). It is easy 
to check that also for the examples in Sects. 12.4.2 and 12.4.3, our assumption on 
the decomposition (13.6) is satisfied. 

The case of Galilei symmetry (Sect. 12.4.4) is most nontrivial. Here the commu- 
tation relation [P,, Ks] = iô-s Q implies that the d-dimensional space spanned on 
the components of the momentum operator does not carry a representation of Galilei 
boosts. Hence the boosts should not be included in the isotropy subgroup Ay, ,0). 
The way out, as shown in Sect. 12.4.4, is to introduce a vector order parameter A” 
that gives a vector of NG variables £”, carrying a nonlinear realization of the boosts. 
Then, Hyo, Ap),0) is either SO(d) or SO(d) x U(1)g. In both cases, the Lie algebra 
g of G naturally splits into invariant subspaces of H((yo,Ao),0) as in (13.6). This 
observation lends further support to the mathematical realization of Galilei boosts 
in terms of the auxiliary vector of NG fields £”. I will return to this in Sect. 13.3.3. 


In order to reformulate the above basic assumption in a language more common 
in physics, we need a notation for the symmetry generators. In analogy with the 
treatment of internal symmetries in Chap.7, I will denote generators that form a 
basis of g as Q 4,B,.... A subset of these generators that spans a basis of Hy, 9) will 
be Qa,g,.... Analogously, the basis of ho /D(yp,0) will be Qz.p,.... Finally, for the basis 
of the complementary space g/g, I will use the notation P,.y,.... In the fixed basis 
Q 4, the structure of the Lie algebra g is defined by the structure constants fsz via 
[QOa, QB] =i ie Qc. Then, our assumption that all the components of the direct 
sum (13.6) carry a representation of H(y,,0) restricts the commutators of Qg to 


[Qu, Osl=iffgQy, [Qa,Qol=iff,Qc, (Qa, Pul=ifguPv. (13.7) 


The remaining commutators of the Lie algebra g take the generic form 


lOa, Ob] =i} Qy + f5,0c), (Qa, Pal =i f8,O6 + f Qo + SEP), 
[Pu Pili Qa + ie tro (13.8) 


There is no f4, P, term in [Qa, Qp] since all the Qa and Qa together span a basis 
of ho. As to Py, at this stage we do not need to assume that they are mutually 
commuting translation generators. 

We are now ready to return to the MC form. By its definition (13.3), œ = @4Q4 
is a (locally defined) 1-form on Ho/H(yp,0) x M that takes values in the Lie algebra 
g of G. It consists of three parts, corresponding to the three spaces on the right-hand 
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side of (13.6), 


O=O| +L tor, 
K a ve (13.9) 
w =o" Qa , 0. =0°Q,, wp =Z e= P. 


The notation for wj and w, follows the conventions introduced in Chap. 7 for 
internal symmetries. The wp component, proportional to the generators P,,, is new 
here. Thanks to the assumption that both spaces ho/D(y),0) and g/bo are invariant 
under the adjoint action of H(y,,0), the transformation rule (13.4) splits as 


wjr, x) £ h(x, goJo, x)A(z, go) | — ihr, go)dh(x, go)’ , 
w(t, x) Š hi, goo. (r, Xh, go)! , (13.10) 
wp (1, x) È h(x, gop (r, xh, go)! . 


Following Chap. 7, it would now be possible to work out further technical details. 
These include for instance an explicit expression for the action of G on the 
coordinates (xf, x"), or the MC equations for the exterior derivative of w(z, x). 
I will however not do so even though it would be straightforward, for we will not 
need these details. Let us instead turn to the physical interpretation of the various 
parts of the MC form. 


13.1.2 Covariant Derivatives of Fields 


The first part of the MC form in (13.3), —iU ()~!dU (x), obviously takes values 
from the Lie algebra bg. It follows that wp does not contain any contributions 
proportional to dz“. In other words, wp is a well-defined 1-form on the slice 
{x} x M ~ M for any fixed point 2° in Ho/ H(y,0), 


e(n, x) = es (a, x)dx” . (13.11) 


By assumption, the action of G on the spacetime M is transitive. This implies that at 
x“ = 0, the components of wp (0, x) = (—17, LAIT; x)“ P, span the whole cotangent 
space to M. By continuity, the same must be true in some neighborhood of the 
origin of Ho/Hyo,0), 7“ = 0. We conclude that the 1-forms e*4(z, x) with fixed 
wz define a (local) coframe on the spacetime manifold M. We are of course free to 
use any (local) frame or coframe on M we wish. Nevertheless, it will be convenient 
to stick to wp thanks to its covariant transformation properties under G, as shown 
in (13.10). This justifies a posteriori the notation e*“ for the components of wp. 
Next we have a look at the w part of the MC form, 


wo (7, x) = wf (1d? + of Cr, x)dx" . (13.12) 
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The wf piece comes from the —iU(a)~'dU(x) term in (13.3) and is thus 
independent of the spacetime coordinates. The full 1-form œw; however contains 
components from cotangent spaces to both Ho/H(y,.,0) and M. Unlike in the case 
of internal symmetries, it therefore cannot be interpreted as giving rise to a coframe 
on Ho/ Hapo,0)- 

At this point, recall that we eventually want to treat x“ as fields. These naturally 
define maps from M to Ho/Hy,,0) that take any x € M to r“(x) € Ho/Aiyo,0)- 
Slightly more formally, one can view the fields as maps from M to Ho/H(yo,0) x M 
that assign to x e M the pair (f(x), x”). This makes it possible to pull 
differential forms on Ho/ H(jo,0) x M back to M. Thus, the coframe (13.11) becomes 
e“E(r(x), x) = ġa (x), x)dx’.! More interesting is the pull-back of (13.12) to 
M, 


œ (a(x), x) = [wp (1 (x)) 8,70" (x) + wa (1 (x), x)]dx” . (13.13) 


This obviously carries information about the derivatives of the NG fields 1“ (x). To 
extract the covariant derivative V,7“ of the NG field, we decompose the spacetime 
1-form w“ (z (x), x) in the coframe e*“(z(x), x), 


wf = (Vu )e** = (Vy )es#dx” . (13.14) 
Comparison with (13.13) then gives 
Vr" (x) = [of (a (x)) Aw? (x) + 8 (xe (x), xe (T(x), x) (13.15) 


where e, (7, x) = e, (T, x)ð is the local frame on M, dual to e*“(z, x). The cova- 
riant derivative can also be expressed compactly as Vur’ = œf (eu). 


I am using here the same symbol V, for the covariant derivative as in 
Appendix A.5, yet the two objects do not seem to be the same. Let me clarify 
the difference. First, the first term on the right-hand side of (13.15) is not 
just the gradient of x° projected to the frame e,. The extra factor œ}, as 
well as the wf term, is needed to ensure covariance under the action of G. In 
particular the œ; factor is nontrivial even if G is an internal symmetry group. 
Second, we are not introducing here an all-purpose connection on M that 
would make V,,7 covariant under the maximal structure group GL(dim M). 
As follows from (13.10), we only allow changes of the local frame induced 
by a representation of the structure group H(y,,0). This procedure is designed 


(continued) 


lI take the liberty to denote 1-forms on Ho/ Hy,0) X M and their pull-backs to M using the same 
symbol. It should be clear from the context which of the two is meant. 
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to give a G-invariant EFT for the NG fields. In spite of using covariant 
derivatives, the EFTs constructed below are not covariant under general 
coordinate transformations on M. 


The remaining part of the MC form (13.9) that we have not discussed yet is œj. 
This, as the transformation rule (13.10) suggests, plays the role of an )(y,),9)-valued 
connection. Let us define the spacetime components of the connection after pull- 
back to M through 


wrx), x) = w (1 (x), x)Qadx” . (13.16) 


Then for a matter field x° that transforms linearly in a representation D of H(yo,0), 
cf. (12.10), the covariant derivative can be defined as 


Vax! (x) = [dox Œ) + iw} a), x)D(Oa)®g x? @) Jen (wr), x). (13.17) 


This generalizes the covariant derivative we constructed in the context of internal 
symmetries in Sect. 8.2.4. 

We conclude by checking in what precise sense the derivatives (13.15) 
and (13.17) of the NG and matter fields are covariant. To that end, consider 
a transformation by an element of g e€ G infinitesimally close to unity, 
g ~ e+ie4Q,, where e^ is a set of small constant parameters. The element 
h(x, go(x, g)) € How, defined by (12.10) with go(x, g) = Tye pets then 
reads, to linear order in €”, 


A(z, go(x, g)) © e + ief k% (1, x) Qo - (13.18) 


Here k% (r, x) is a set of functions that are calculable following the guidance of 
Sect. 7.3, but we need not do so explicitly. It is easy to check that the last two lines 
of (13.10) now translate into the infinitesimal transformations 


wt (T, x) = —€4k% (x, x) ffo (7, x) , 
(13.19) 
Se" (or, x) = —eARY (r, x) fH e(r, x) . 


By the duality between the frame e,, and the coframe e*, the transformation of the 
latter gives $e, = Eka fuev- Equipped with these auxiliary identities, we arrive 
at the infinitesimal transformations of the covariant derivatives (13.15) and (13.17), 


8(Vym") = ARG (— fa, Van? + fL Vyr"), 


(13.20) 
S(Vu Xe) = €AKS[ID(Ou)?g Vx? + FE,VvX2) - 
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The covariant derivatives transform linearly. This makes it possible to construct 
G-invariant Lagrangian densities out of the (possibly higher) covariant derivatives 
using standard methods of tensor algebra and representation theory. 


Recall the example worked out in Sect. 12.4.2. Here G ~ Rt x ISO(d, 1) acts 
on a set of relativistic scalars, where ISO(d, 1) is the Poincaré group and Rt 
represents scale transformations of spacetime coordinates. Suppose that the ground 
state of the system carries a uniform condensate, spontaneously breaking the scale 
invariance. We choose accordingly wo € M that breaks the dilatation group Rt. 
Thus, H(y,0) ~ SO(d, 1) whereas Ho ~ Rt x SO(d, 1). 

Let us disregard possible matter fields and focus on the dilaton field z, 
parameterizing the coset space Ho/ Ay ,0) ~ R. We fix x“ to be the standard 
Minkowski coordinates, and P, the energy-momentum operator. The manifold 
Ho/ Howo X M > R¢+? is naturally parameterized by U(r, x) = e Pelt? 2 
With the help of the commutation relation [D, Pu] = —iPy, we compute the MC 
form 


olr, x) = iU, x) tdu, x) = Ddr + e™™ P -dx . (13.21) 


From the second term, we extract the spacetime coframe e*4 = 54e~* dx” and the 
corresponding dual frame, e,, = one dp. The dual frame can be used as a covariant 
derivative operator, V,,, that is scale-invariant. 

To produce an action, we still need a volume element. This is most easily 
obtained by constructing a volume form on the Minkowski spacetime using the 
coframe, 


vol = e? A... A e*l =e tT G9 pn. A dxf. (13.22) 


A generic Poincaré- and scale-invariant effective action for the dilaton then reads 
Seter} = pots e Gt 2 e(Vr, VY, ...). (13.23) 


The Lagrangian -Zee can be built by writing down the most general Lorentz- 
invariant operator in terms of the derivatives of x, and then replacing ð, with V,, 
everywhere. 


? In order to avoid notation clash with the dilatation operator D, I temporarily denote the dimension 
of spacetime as d + 1. 
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13.2 Twisting Order Parameter for Internal Symmetry 


We have reached the point where we can start dealing with concrete systems 
featuring spontaneous breakdown of spacetime symmetry. As mentioned above, I 
will always assume that the effective action can be constructed solely out of the MC 
form. Any claim of having found the most general invariant action therefore has 
to be interpreted accordingly. It is not always straightforward to construct the most 
general effective action even in this restricted sense. To make further progress, it is 
convenient to narrow down the landscape of systems that we consider. 

In this section, I will elaborate on the example of nonlinear realization worked out 
in Sect. 12.4.1. Here is a brief reminder. Suppose that the symmetry group factorizes 
as G ~ Gst. X Gin, where “s.t” and “int” stand respectively for “spacetime” 
and “internal.” Moreover, suppose that the target manifold M is parameterized 
by a set of fields yi that are scalar in the sense that Fi (y, X, 8st.) = yi for all 
8st. € Gs... This is equivalent to requiring that Gs.t, is a purely spacetime symmetry. 
Likewise, Gint is internal in the sense that F' (W, X, Zint) is independent of x” and 
X” (xX, Zint) = x” for all Zint € Gint. The action of the symmetry group G on the 
standard coordinates (7%, x°, x) is then given by (12.19). The isotropy group 
of the spacetime origin is Hp ~ Hst. X Gint, and the spacetime manifold M is 
equivalent to the homogeneous space Gs.t./Hs.t.. Similarly, the isotropy group of 
(Wo, 0) € M x M is Ags. x Him, hence Ho/ A ayy,0) ~ Gint/ Aint. 

I will now make one additional, important assumption. Let us choose the 
reference point wo € M so that it represents the actual value of the order parameter 
at the spacetime origin, (y/(0)) = Wi. Suppose that in the resulting standard 
coordinates on M, the vacuum expectation value (VEV) of all the matter fields, 
(x£ (x)), is zero. This is equivalent to requiring that the order parameter is fully 
specified by the VEVs (z“(x)). The values of the order parameter at different 
spacetime points lie on the same orbit of Gint in M. It is then mathematically 
consistent to eliminate the matter fields from the EFT altogether. After all, they 
are expected to excite gapped modes in the spectrum, which can be ignored at 
sufficiently low energies. 


Example 13.3 


Consider a theory of a single real relativistic scalar field ¢, equipped with an internal 
shift symmetry, (¢, x") $ (ġ +€, x”). This corresponds to Gst. ~ ISO(d, 1), 
Hst. ~ SO(d, 1), and Gin ~ R. For any choice of the reference point ġo, the 
internal isotropy group is trivial, Hint ~ {e}. In this case, the only degree of freedom, 
that is @ itself, is of the NG type. There are no matter-type variables. Therefore, our 
assumption that the VEV of all x? be vanishing is trivially satisfied. 

A slightly less trivial example is that of a complex relativistic scalar ¢, subject to 
the action of Gint ~ UC) via (¢, x”) = (eo, x). For any nonzero ġo € C, we 
have again Hint ~ {e}. The field is then naturally parameterized by its modulus and 
phase, ¢ = ge’, where 6 is the sole NG variable. Provided (o(x)) = Qo is constant, 
o can be traded for a matter field with vanishing VEV as desired, x = ọ— ọọ. On the 
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other hand, the VEV of the phase, (@(x)), can have arbitrary coordinate dependence. 
This kind of order parameter describes relativistic superfluids. 


Discarding the matter fields amounts to restricting our basic setup to the manifold 
Ho/ Hiyoo) Xx M ~ Gin/ Hint X Gs.t./Hs.t.. Henceforth, I will only consider flat 
spacetimes such that Gs.t. possesses a set of D mutually commuting translation 
generators P,. With the parameterization W(x, x) = eP U(x), the MC form 
becomes 


wln, x) = —iU (m) !dU (x) + P -dx . (13.24) 


The spacetime coframe is trivial, e*4 = ô#dx”, and it is not necessary to distinguish 
frame and coordinate-basis indices. The œw and w; parts of the MC form (13.24) 
are fixed by the internal symmetry. We can therefore reuse the wealth of information 
accumulated in Chap. 7 to evaluate them. 

So far, I have not made any assumptions about the coordinate dependence of the 
order parameter beyond the requirement that all (x°(x)) vanish. Whether or not 
the order parameter breaks any spacetime symmetries has no bearing on the set of 
degrees of freedom x° of the EFT, their transformation under G, or the MC form. 
Yet, the devil is in the details, as we shall now see. 


13.2.1 New Features of the Old Setup 


A consistent low-energy EFT setup requires a number of ingredients to be in place. 
The first is of course having a set of well-defined degrees of freedom, well separated 
from whatever high-energy modes have been ignored. This is generally guaranteed 
by the broken symmetry. These low-energy degrees of freedom should map to 
positive-energy excitations of a stable ground state. Failure to satisfy this condition 
indicates that we have not identified the correct ground state. Finally, in the absence 
of further physical input, it is necessary to include in the effective Lagrangian 
all operators consistent with the given symmetry. Typically, infinitely many such 
operators exist. Hence, we need an organizing principle to decide which operators 
are relevant and which can be neglected: power counting. 


Stability of the Ground State Our choice of the reference point, Wå = (w'(0)), 
corresponds to (7“(0)) = 0, or (U(xz(0))) = e. The actual order parameter 
defines a map from M to the coset space Gint/ Hint, (U(a(x))) = Uo(x). Nontrivial 
coordinate dependence indicates that the order parameter is twisted. Importantly, not 
every choice of Uo(x) is physically consistent. A necessary condition is that Uo(x) 
is a local minimum of the energy functional of the EFT. This is straightforward to 
check in two steps. First, Uo(x) should be a stationary state, that is a solution of 
the equation of motion (EoM); see Sect. 8.3 for details on the latter. Second, the 
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spectrum of small fluctuations about this state should only contain positive-energy 
excitations. 


Example 13.4 


Recall the helimagnetic state, introduced in Example 6.5, 
(n(x, t)) = (cos kz, sin kz, 0) , (13.25) 


where k is a positive constant. In this case, the order parameter is n € 
SO(3)/SO(2) ~ S?. It is spatially twisted so that it forms a helix oriented 
along the z-axis, with pitch 27/k. In Example 9.10, I showed that this order 
parameter minimizes the energy in the EFT for ferromagnets augmented with the 
Dzyaloshinskii-Moriya (DM) interaction. The precise value of k is fixed by the 
coupling of the DM term. 


Ensuring the stability of the ground state is an integral part of the construction of 
the low-energy EFT. The requirement was already implicitly present in our analysis 
of internal symmetries in Part III of the book. The fact that the vacuum, m° = 0, 
is a solution of the EoM was guaranteed by the absence of terms linear in x° in 
the effective Lagrangian. Such terms were typically forbidden by the nonlinearly 
realized symmetry. Only in one exceptional example, namely the class of Galileon 
theories (Sect. 10.3.2), I enforced the absence of linear terms ad hoc by discarding 
the tadpole operator. Furthermore, stability under small fluctuations was ensured 
by fixing the signs of the parts of the effective Lagrangian bilinear in 1“, carrying 
respectively two temporal or two spatial derivatives. 


Power Counting Eventually, we would like to parameterize the EFT by NG fields 
T’ (x) such that (7“(x)) = 0. This makes it easy to distinguish the ground state 
from its fluctuations. It is often convenient to factorize the matrix variable U (zr) as 


U(a(x)) = Uo(x)U (T (x)). (13.26) 


This maintains a simple transformation rule for 7“ under the action of Gint. Indeed, 
a comparison with (12.19) shows that 


U (it) ËS UE G, 8i) = Up 8inU)U (Fhin, gine) . (13.27) 


In other words, the action of Gint by left multiplication is simply conjugated by 
Ug 

The challenge to face is that following blindly the machinery of Chap. 8, we 
arrive at an effective Lagrangian where derivatives act on U (zr) rather than on U (zr). 
This is not a problem if the characteristic length scale of the symmetry breaking 
Gint —> Hin is much shorter than the scale of variations of the ground state Up(x). 
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Such a hierarchical symmetry breaking occurs for instance in helimagnets. Indeed, 
the helix pitch in FeGe is about 70 nm [1], which is much longer than the scale of the 
crystal lattice at which the magnetic order is formed. In such systems, derivatives 
of both U (ñ (x)) and Uo(x) can be considered small and counted together. Should, 
however, the order parameter feature short-distance variations, a different power 
counting is needed. The precise setup of the power counting that renders the EFT 
predictive is then best considered case by case. 


13.2.2 Case Study: Relativistic Superfluids 


From the point of view of symmetry, a superfluid is an ordered phase of matter where 
an internal U(1) symmetry is spontaneously broken. We therefore need an order 
parameter charged under this symmetry. The simplest choice is to take a complex 
scalar field ¢ carrying the action of Gint ~ U(1) via (¢, x”) S (e€ġ, x). We then 
assume that due to microscopic dynamics, this field develops a nonzero VEV. That 
is however not enough. We want the vacuum to describe matter, that is a state with 
nonzero density of the U(1) charge. In a Lorentz-invariant scalar theory, a state of 
uniform nonzero charge density corresponds to a time-dependent VEV, 


($(x)) = g(x) = woe. (13.28) 


The constants gp and u are without loss of generality assumed to be positive. See 
Chap. 3 of [2] for an introduction to superfluidity from the field theory point of view. 


The same superfluid state can also be represented by a constant order param- 
eter at the cost of redefining the Lagrangian. Indeed, introduce a new scalar 
field, ®(x) = e' (x), so that our superfluid state amounts to (®(x)) = go. 
As a consequence of the field redefinition, d9¢ = (09 — iu). This agrees 
with my previous observation (see Example 8.7) that an equilibrium state 
parameterized by the chemical potential u can be described in the Lagrangian 
formalism by a constant background gauge field, A, = (w,0). It is now 
clear that the order parameter (13.28) corresponds to a superfluid at rest. 
Using Lorentz invariance, a superfluid state in uniform motion can likewise 
be parameterized by a constant timelike A„, or equivalently by the VEV 
g(x) = poe 4. 


As we already saw in Example 13.3, the condensate (13.28) satisfies our technical 
assumption underlying the construction of EFT for systems with a twisted order 
parameter. Here we have U(z(x)) = ei7()2 where w(x) is the phase of (x) 
and Q is the generator of Gint ~ U(1). The NG field transforms under the internal 
symmetry as 70 É z + e€. The MC form (13.24) reduces to w(z, x) = Qdr +P -dx. 
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Upon dropping the tadpole operator, x, which is itself quasi-invariant, the most 
general Poincaré- and U(1)-invariant effective action can be written as 


Sepp {7} = [ors Løn, O0T,...). (13.29) 


The effective Lagrangian eg is an arbitrary function of the derivatives of x in 
which all Lorentz indices have been contracted in a Lorentz-invariant manner. 

Let us check that the tentative ground state, (7(x)) = —wt, satisfies the 
EoM. This follows from the equivalence of the EoM and the conservation of the 
Noether current of the internal U(1) symmetry; see Example 4.2 for a proof of the 
equivalence. The current carried by the condensate (13.28) is necessarily constant, 
and thus automatically conserved for any values of go and pj. 

Next, we have to deal with power counting. I will follow closely the logic of 
Sect. 9.1.1, where a power-counting scheme for relativistic EFTs was worked out. A 
new complication is that we do not know a priori how large the chemical potential 
u is, and thus whether 0,7 can be considered small. We therefore cannot naively 
expand the effective Lagrangian in the derivatives of 2. However, we can use the 
fact that every x in the Lagrangian always carries at least one “persistent” derivative 
and only count additional derivatives. This makes sense: introducing the fluctuation 
field w via (13.26), that is w(x) = ñ (x)— ut, any higher derivative kills the chemical 
potential and thus captures directly the variation of 7. 

To formalize this observation, expand the effective Lagrangian as 


Llr] = X OP ir] (13.30) 


n>0 


were BAe collects all Lorentz-invariant operators with n additional derivatives. 
Now consider a generic Feynman diagram I, contributing to a given observable. 
Denote as J the number of internal propagators in I’, as E the number of external 
legs, as L the number of loops, and as V, the number of interaction vertices from 
each of a. Think of the Fourier representation, in which the diagram amounts to 
a homogeneous function of energy—momenta on the external legs. Loop integration 
contributes altogether DL powers of energy—momenta, the internal propagators 
contribute —27 powers. The persistent derivatives on each field give 27 + E 
powers of energy—momenta. Finally, the additional derivatives add up to „>o 7 Vn. 
Altogether, the naive degree of the diagram becomes E 


deg? = DL +E +9 nVn. (13.31) 


n>0 


For any given observable, E is fixed and degr > E. The leading-order (LO) 
contribution comes from diagrams with L = 0 and V, = O for any n > 1. 
Accordingly, the LO effective Lagrangian for a relativistic superfluid is LO: 
an arbitrary Lorentz-invariant function of 0,7. The next-to-leading-order (NLO) 
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Lagrangian requires two additional derivatives, n = 2. For any D > 3, the NLO 
contribution to any observable corresponds to deg T = E + 2 and amounts to tree- 
level (L = 0) diagrams with one interaction vertex from WAN and all others from 
2O, Interestingly, the contributions from loop diagrams only start to matter beyond 
NLO of the derivative expansion. Most important is that at any finite degree, only 
a finite number of Lagrangians KAA and a finite number of Feynman diagrams is 
required. This renders the power-counting scheme consistent and predictive. 

It might still look troublesome that the LO Lagrangian can be an arbitrary 
Lorentz-invariant function of ð r. This surely contains an infinite number of 
unknown parameters, so how could such an EFT be useful? To address this question, 
note there is only one way to make such Lagrangians Lorentz-invariant, namely by 
contracting the Lorentz indices on 0,7 pairwise. Then, upon separating the ground 
state from the fluctuations, the LO Lagrangian becomes 


LY lt) = P OF) P( y @ñ jj (Vit)?) , (13.32) 


where P is some as yet unknown function. In the superfluid ground state, the 
Lagrangian reduces to a function of the chemical potential, P (u). This can be 
related to the energy density of the equilibrium state by Legendre transformation, 


Hew |__, = UW) =uP'w — PW). (13.33) 
The prime indicates a derivative of P with respect to its argument. Also, I used 
that aL /A(dor) = —0-Z\) /ðu and that (r (x)) = —ut. Equation (13.33) looks 
familiar: P (u) can be interpreted as the thermodynamic pressure of the superfluid 
at zero temperature, and P’(j) = n(m) as the density of the U(1) charge. Our main 
result therefore is that the LO EFT for superfluids at zero temperature (13.32) is 
completely fixed by the thermodynamic equation of state [3]. 

To get a flavor of the physical content of the EFT, let us expand (13.32) to second 
order in the NG field, 


Lye Ut] P(w) + 2 W (ait)? — (VĒ +. (13.34) 


This shows that the NG bosons propagate with the phase velocity 


-J P' (w) -jz du _ a (13.35) 
MP" (pL) u dn dU 
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where I used the thermodynamic relations dU = udn and dP = nda. It is 
remarkable that the phase velocity of the superfluid NG boson at zero temperature 
is given by the same expression (13.35) as that of hydrodynamic sound. 


Example 13.5 


For an illustration of these general results, let us have a look at dense matter 
consisting of relativistic (Dirac) fermions in d = 3 dimensions [3]. Provided the 
interactions between the particles are sufficiently weak and their mass sufficiently 
small, the equation of state can be well approximated by that of a free gas of 
massless fermions,’ 
4 3 4 

Pw = pa rws U= az. (13.36) 
Suppose that the interaction between the fermions is attractive so that they form 
Cooper pairs in the spin-singlet, s-wave state. The fermionic quasiparticles near 
the Fermi surface then become gapped and decouple at sufficiently low energies. 
The low-energy physics of the system reduces to the dynamics of the condensate 
of scalar Cooper pairs. This is an example of a fermionic superfluid. According 
to (13.32), the low-energy EFT of such a system is dominated by a LO Lagrangian 
that is polynomial in the NG field, 


LO ir] = = [ (ao — w? vit : (13.37) 
The NG bosons propagate at the speed v = 1/./3, characteristic of a gas of weakly 
interacting ultrarelativistic particles. Note that in this case, the microscopic energy 
scale of the system is set by the chemical potential itself. The scale associated with 
the spontaneous breakdown of the U(1) symmetry is assumed to be much smaller, 
which is ensured by the weak binding of the Cooper pairs. Thus, u could not have 
been treated as a small parameter in any reasonable sense. 


13.3 Vector Modes: The Relevant, the Irrelevant 
and the Unphysical 


As demonstrated in the previous chapter, the same symmetry-breaking pattern can 
be realized with various choices of order parameter. This ambiguity is innocuous 
for internal symmetries. On the other hand, in case of spacetime symmetries, it can 
lead to nonlinear realizations with apparently different numbers of NG variables. 


3 This restriction is of course just a matter of convenience. Our general result for the LO 
EFT (13.32) applies equally well to a strongly-interacting gas of massive fermions. 
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For instance, the superfluid order parameter (13.28) breaks the combined spacetime 
and internal symmetry in a rather nontrivial manner. First, it obviously breaks 
Lorentz boosts. Second, it breaks time translations and the internal U(1) symmetry 
down to a “diagonal” subgroup, where the phase generated by a time translation is 
compensated by a U(1) rotation. In the setup of Sect. 13.2.2, all this is implicitly 
taken into account. However, as pointed out in Sect. 12.4.3, it might be meaningful 
to add a secondary, vector order parameter, representing the VEV of the Noether 
current of the U(1) symmetry. This requires introducing an additional vector of 
NG fields. Similarly, we saw in Sect. 12.4.4 that nonlinear realization of Galilei 
symmetry is easier to implement if one adds a vector of NG variables corresponding 
to Galilei boosts. 

These observations raise obvious questions. Are the additional vector NG 
variables merely a useful mathematical tool, or do they correspond to actual modes 
in the spectrum? In case they are physical, what is their exact nature? To answer 
these questions, I will now work out several physically distinct examples, from 
nonlinear realization to concrete effective actions. With the additional insight, I will 
then in Sect. 13.3.4 attempt to draw general conclusions. 


13.3.1 The Relevant: Helimagnets 


Let us start with a familiar system where a vector field turns out to excite a physical, 
gapless NG mode. This will set a benchmark for our discussion of the role of vector 
NG modes. At the same time, it will provide the first example of an EFT going 
beyond the class of systems covered by Sect. 13.2. I already addressed spin systems 
in detail in Chap. 9. Moreover, I mentioned the exotic case of helimagnets on several 
occasions. I will therefore not spend time on reviewing the basic phenomenology, 
and instead start right away with a symmetry-based analysis. 

We are talking about a condensed-matter system whose symmetry is captured 
by the Aristotelian group in three spatial dimensions, G ~ SO(3) x R4. Unlike in 
Chap. 9, I will not include a separate internal symmetry under spin rotations. This 
is broken by the spin-orbit coupling, which plays a key role in the formation of the 
helimagnetic order. The order parameter is the local magnetization density, which 
is a vector under spatial rotations, hence M ~ R?. We are of course interested in 
the physics of ordered states where the magnetization is nonzero. This fixes the two 
isotropy groups Hp ~ SO(3) and Hiyo,0) ~ SO(2). The local manifold decomposi- 
tion (13.5) in this case becomes 


G ~ SOQ) x S? x Rt. (13.38) 


The submanifold relevant for the nonlinear realization of the symmetry on the NG 
variables is Ho/ Heyo,0) x M = S 2 x R4, I will parameterize the spacetime using the 
translation operator Ty , = e” e* P. The coset space S* is parameterized by two 
local coordinates 2“, encoded in an SO(3)-valued matrix U (zr). 
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Having fixed the basic setup, we can read off the MC form from (13.3), 


a(r, x) = —iU (x) !dU (r) + Hdt + U(r)! (P - dx)U (r) 
(13.39) 


—iU (1) 'dU (x) + Hdt + P, [U (r) "FF, dx° . 


The second equality follows from the fact that the momentum operator carries 
a vector representation of SO(3) under the adjoint action thereof.* Using (13.9), 
we subsequently identify the elements of the spacetime coframe, 


eLa, x)= dt, — eE, x) = [Um E, dx" . (13.40) 


The w part of the MC form comes entirely from —iU (x)~'dU (x), i.e. is given by 
wo" (1) = wf (r)dz”. The related covariant derivatives of the NG variables z“ are 
extracted using (13.15), 


Vor’ = wf (1)dor” , Vrn’ = wy (a)U (r) ðs” . (13.41) 
Under the action of Hop,0) ~ SO(2), these split into the irreducible multiplets 


b2Var" , eP $40 anp , 
Z 7 (13.42) 
Vor" , Var" , ô Vanp + 55 Vala — SapdeVeI~ . 

Those on the first line of (13.42) are singlets, while the first two items on the 
second line transform as vectors under SO(2). Finally, the last operator in (13.42) 
transforms as a traceless symmetric tensor of SO(2). That is also a two-dimensional 
real representation, but carries a double charge of SO(2) compared to the vector 
representation. This is important when combining the irreducible multiplets into an 

effective Lagrangian in a way that preserves invariance under H(yg,0) ~ SO(2). 
Before we can construct an effective action, we still need to consider power 
counting. Here I will, unlike for superfluids in Sect. 13.2.2, assume that the spatial 
variation of the order parameter is slow. This allows us to apply the naive scheme 
where each (covariant) derivative counts. The EFT will then be dominated by 
operators with the lowest number of derivatives. It is convenient to express the 
effective Lagrangian in terms of a unit-vector parameterization of the coset space, 
n(x) € S°. This is related to U(x) via n(1) = U(st)no, where no € S? is an 
arbitrary but fixed reference vector. The advantage of using the n variable is that it 
makes invariance under the whole Ho ~ SO(3) manifest. Dropping surface terms, 


4 In Chap. 9, I worked out the EFT for spin systems using the fundamental (spinor) representation 
of SO(3). It goes without saying that this is just a matter of convenience. The nonlinear realization 
of symmetry on coset spaces as detailed in Sect. 7.3 only depends on the structure of the symmetry 
group. In the present case, the use of the vector representation of Hy ~ SO(3) is suggested by the 
transformation properties of the momentum operator P. 
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the most general invariant Lagrangian with up two derivatives consists of 


go” = cn V xn), 


e 


Lo” = 5°50 Ign + POV - n)? + On. (V x n) 


e 


+c (V -mfn (V x n)], (13.43) 


LEP = cP don - [(n - V)n] , 


e 


0,2 0,2 
LE = cP (Gon)? . 


All the operators in (13.43) are clearly invariant under spatial rotations. 
What is less obvious is that there are no other algebraically independent 
contributions to the invariant Lagrangian with up to two derivatives. This 
follows from the formalism based on the MC form, but would not be 
straightforward to check in the language using the n field. Let me at least 
clarify why some apparent candidate operators do not give anything new. 
Starting with Eads there is no operator of the type (V x), since this equals 
6'°d,n-dsn—(V-n)* up to a surface term. Likewise, the operator [n x (V xn)]? 
is absent, being equal to (V x n)? —[n-(V x n)]?. Other candidate operators 
can be eliminated using the identity (n - V)n = —n x (V x n), valid for a unit 
vector field n. Finally, there is no dp” - (V x n) in gA; D since this is itself a 
surface term. Similar reasoning shows that no new operators can be produced 
using second covariant derivatives such as ôg Vo Vaz“ or ôg Va Van". 


The presence of a with a single spatial derivative is a new feature, arising 
from the vector order parameter. There is no invariant operator with a single time 
derivative. However, we know from Chap. 9 that an internal SU(2) spin symmetry 
together with spatial rotational invariance admit a quasi-invariant Lagrangian, 


b 
gO) (0.1) Eabrt don 
eff 1 1473 


: (13.44) 
where a,b e {1,2}. Being free of spatial derivatives, this is also quasi-invariant 
under our Hp ~ SO(3) that acts on both n and spatial coordinates. Finally, note that 
the coframe (13.40) yields the standard volume measure d?r dt thanks to the fact that 
U (zr) is an SO(3)-valued matrix. This justifies a posteriori dropping surface terms 
from (13.43), and makes it actually possible to include a quasi-invariant Lagrangian 
such as (13.44). 

In ferromagnets where the coupling c0 D ig nonzero, every time derivative 
counts as two spatial derivatives (see Sect. 9.2.1 for a detailed justification). We 


can then discard the ao and oo Lagrangians as subleading. Even with this 
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simplification, there are still six independent operators in the dae a and 
BA Lagrangians. In order to make the analysis of the physical consequences 
of the EFT feasible, we need to take one more step to reduce the Lagrangian. 
Namely, the spin-orbit interaction in real materials is typically much weaker than 
other interactions. It therefore makes sense to treat the couplings of all the operators 
that lock the spin and orbital rotations together, namely cf - aa a and cf D, 
as small. Counting all of these formally as degree one in the derivative expansion, 
the entire LO (degree-two) effective Lagrangian will be just 


abn" don” a 6’ ən : Osn = anas 


Lp = -M 
a 1+n3 2 ADM 


n- (V xn). (13.45) 


Here I have already switched to the physical notation for the effective couplings: M 
for the local magnetization density, @, for the spin stiffness, and Apm for the length 
scale of the DM interaction. The latter is all that is left of the effects of the spin-orbit 
coupling at the LO of the derivative expansion. 

I showed already in Example 9.10 that the DM interaction inevitably leads to 
the formation of a helical structure in the ground state. One can without loss of 
generality choose a Cartesian system of coordinates such that 


(n(x, t)) = no(x) = (cos kz, sin kz, 0) , (13.46) 


where k = 2x /àpm. Let us next focus on the excitation spectrum. The first step in 
that regard is to reparameterize the field following the logic of (13.26), 


cos kz — sin kz 0 
n(x,t) =| sinkz coskz 0} N(x,t). (13.47) 
0 0 1 


The variable N (x, t) is still a unit vector. Moreover, in the ground state it is constant, 
(N(x,t)) = No = (1,0,0). This allows us to identify independent fluctuations 
around the ground state with the N? and N? components of N. 

The task to express the Lagrangian (13.45) in terms of N is a bit tedious, and the 
final result is not elegant. I will therefore skip the straightforward details and only 
spell out the part of the Lagrangian bilinear in N*, N°. Up to a surface term, 


LO . Qs y? A(z) N? 
L S a N?) ee we 3 (e) OEE (13.48) 


where 


M An . 
A(z) = —— 09 + —— [(cos kz)ðı + (sinkz)d2] . (13.49) 
Os ADM 


S 
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The excitation spectrum is determined by the zero modes of the matrix differential 
operator in (13.48). Due to the explicit coordinate dependence, this is a complicated 
problem. I will therefore content myself with some simple observations that 
illustrate the peculiarities of helimagnets. To start with, the presence of operators 
with a single time derivative makes the two degrees of freedom N?, N? canonically 
conjugate to each other. The spectrum should therefore consist of a single type of 
magnon, just like in uniform ferromagnets in the absence of the DM interaction. 
The magnon dispersion relation can be calculated explicitly in the special case 
of propagation along the helix axis. The single-particle wave function is then 
independent of the x, y coordinates so that A —> —(M/Qs)do9. A Fourier transform 
in time and the z-coordinate then gives the energy as a function of momentum, 


E(p:) = s Ipzl4 pe +k? . (13.50) 


The physical momentum p; should be much smaller than the inverse of the length 
scale associated with the formation of the ferromagnetic order, which can be thought 
of as ./Qs. This defines the range of validity of the EFT. Within this range, however, 
there are two qualitatively different regimes. At long wavelengths, p; < k, (13.50) 
is well approximated by E(p,) ~ (kos/M) |pz|. The dispersion relation is linear. 
The corresponding one-particle state is a linearly polarized spin wave with N? ~ 0. 
This amounts to spin oscillations in the xy plane along the direction perpendicular 
locally to no(x). On the other hand, at intermediate wavelengths, k << pz K ,/@s, 
the dispersion relation (13.50) turns into E(p,) ~ (@s/M) pe This reproduces the 
standard ferromagnetic magnon spectrum we found in Sect. 9.2.2. Physically, this 
means that such short-distance oscillations of spin are insensitive to the helical 
structure of the ground state. The magnon is circularly polarized in the plane 
perpendicular to the local magnetization no (x). 

This concludes our excursion to helimagnetism. We have discovered very 
interesting physics, but no surprises as to the number and type of NG modes in the 
spectrum so far. To that end, we will capitalize on the experience we have collected 
here to explore another fascinating physical system: liquid crystals. 


13.3.2 The Irrelevant: Smectic Liquid Crystals 


Liquid crystals are an intriguing exotic state of matter; see [4] for a thorough 
account of the subject. The basic ingredient underlying the liquid crystal order is a 
strongly anisotropic microscopic constituent. This is typically an elongated organic 
molecule or a polymer, with a rigid, rod-like structure. An ordered ground state 
arises from a geometric arrangement of the molecules that breaks invariance under 
spatial translations or rotations, or both. 

I will be mostly concerned with the interplay of two simple liquid crystal phases: 
the nematic phase and the smectic (or more precisely smectic-A) phase. In the 
nematic phase, the microscopic constituents are aligned along a common direction 
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Fig. 13.1 Schematic ) >a u Wwe YA 
visualization of nematic and ) f ) | l ( | ( ( ( T | 
smectic-A orders. A nematic ) ) I i -A Aaaah aha 
consists of a set of mutually li i ) ( | i ) ( Í ) 
aligned undirected rods with ) ( | ( ) ah an A Ahn A 
random positions. In a i | ) : (| ) ( ( i i 
smectic-A, the rods are in ‘ l i ( i is ses an a 
addition organized in layers. l ( l ( | i ( ‘ i l | ) Í 
Their positions within each 8B oo nnr t -E 
layer remain disordered nematic smectic-A 


but their spatial positions remain disordered as in an ordinary liquid. See the left 
panel of Fig. 13.1 for a visualization. In the smectic phase, the alignment persists 
but the constituents are in addition organized in parallel layers. Within each layer, 
however, their positions remain disordered; see the right panel of Fig. 13.1. Both of 
these phases feature an order parameter that defines the axis of alignment, and can 
be viewed as an “unoriented vector.” To describe smectics, one needs in addition a 
secondary order parameter that accounts for the layered structure of the medium. 
The microscopic symmetry governing the physics of liquid crystals is the same 
as that of spin systems with spin-orbit coupling, G ~ SO(3) x R4. In the simpler 
nematic phase, which I will start with, the target space is M ~ R?/Z2, where 
any vector in R? is identified with its opposite. This setup is almost identical 
to that of Sect. 13.3.1, and we can therefore largely reuse the EFT developed 
therein. The change in the global structure of M from R? to R*/Z> affects the 
nature of topological defects in the medium. However, as far as the dynamics 
of small fluctuations of the order parameter is concerned, all we have to do is 
impose invariance under the reflection n — —n. This kills, first of all, the quasi- 
invariant Lagrangian (13.44), for there is no equivalent of “magnetization” in liquid 
crystals that the coupling ch D could measure. Moreover, the oa and ch D terms 
in (13.43) drop out, being odd in n. In materials that respect spatial parity, we can 
finally discard the gao term.° All in all, the LO Lagrangian for the fluctuations 
; : : (2,0) (2,0) (2,0) (0,2) 
of the order parameter in nematics consists of the Cp, C3 C3 and Cy 
operators. However, a somewhat different basis of operators with spatial derivatives 
is more common in the literature, namely 


1 
a > = [kiv -n) + Koln - (V x n)? + K3[n x (V x n)P| 
(13.51) 
The advantage of this parameterization is that there are field configurations for 


which all but one of the operators in (13.51) vanish. Global stability of the 
uniform state with constant (n(x)) = mo thus requires that all the couplings 


5 The or term is relevant for cholesteric liquid crystals whose building blocks possess intrinsic 


chirality. The mathematics of cholesterics is largely identical to that of helimagnets [5]. 
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Kı,2,3 are positive. For completeness, let me add that they are related to the 
parameters in (13.43) by Ky = —2(c 4c), Ky = —2¢€ 4 2) and 
K3 = —2c," ) up to integration by parts. 

Unlike in the case of helimagnets, I will not use the Lagrangian (13.51) to analyze 
the spectrum of excitations of a nematic. Namely, to get a physically correct picture 
of the dynamics, one would need to consider the interplay of the vector variable 
n with hydrodynamic degrees of freedom (see Sect. 8.5 of [6]). After all, we are 
talking about liquids. Our restricted setting will however be sufficient to contrast 
the NG degrees of freedom in the nematic and smectic-A phases. 

To describe the smectic state, we need an additional order parameter accounting 
for the layered structure as shown in the right panel of Fig. 13.1. This can be 
done with a single real scalar field (x). The layers of the medium are then 
defined as surfaces of constant ¢@. The translation across one layer of molecules 
changes @(x) by a fixed amount; I will use a normalization of (x) where the 
increment equals 27. Interestingly, this picture implies the existence of a new, 
emergent symmetry. Namely, for the sake of identification of the individual strata 
of the smectic, it does not matter what the attached value of @ is. None of the 
surfaces of constant @ changes if we shift (x) by a constant. The EFT for 
smectic liquid crystals should therefore be invariant under the internal symmetry 
(n, ġ, x") 5 (n, +€, x”). This is our first example of an emergent symmetry 
arising from the spatial distribution of a medium; I will give a more systematic 
account of such symmetries in Chap. 14. The complete symmetry group of a 
smectic is then G ~ [SO(3) x Rf] x R. The corresponding isotropy groups are 
Ho ~ SO(3) x R and Hiyo,0) ~ SO(2). The degrees of freedom of the EFT are 
determined by the coset space Ho/ Hiyo,0) ~ S? x R. 

The values of the two order parameters in the smectic ground state are correlated. 
As indicated in Fig. 13.1, the axis of alignment nọ of the molecules is perpendicular 
to the entire stack of parallel layers. Moreover, the individual molecular layers 
are equidistantly spaced. This implies that the VEV of the gradient (V@(x)) is a 
constant nonzero vector, parallel to no. I will use the notation 


(P(x, t)) = kno: x , (13.52) 


where the parameter k defines the distance between neighboring layers, 27 / k. 

The shift symmetry R guarantees that (x) can only enter the effective 
Lagrangian with derivatives. Being a scalar under rotations, its covariant derivatives 
are 


Vo = oð , Vib = U(x)*,.asb , (13.53) 


where U (zr) is still the matrix related to n by n(x) = U(z)no. Also, I used the 
previously derived coframe (13.40), which is not affected by adding the scalar field 
(x). The appearance of these covariant derivatives in the effective Lagrangian is 
constrained by the isotropy group H(yo,0) ~ SO(2) and the reflection n —> —n. 
To these we should add the discrete symmetry under ¢ — —@ which, just like 
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the internal shifts of ¢, preserves the identification of the layers of the smectic. 
With these constraints, the contributions of ¢ (x) to the LO effective Lagrangian are 
reduced to 


Ly" D co(dop)* + cyn- Vb)” + cL (a x VO)’ . (13.54) 


In order for the alignment of Vø and n to be energetically preferred, the spatial 
gradient couplings cy and c} should satisfy c} < cy). 

Let us now turn the argument around and see how the background (13.52) affects 
the vector field n. I will set without loss of generality no = (0, 0, 1) so that the two 
independent fluctuations of n correspond to n! and n?. This reduces (13.54) to 

LE > (ej — ck [n + n]. (13.55) 


Thanks to the stability constraint on the couplings, c} — c1 > 0. What we have here 
is therefore a well-defined, positive-definite mass term for the fluctuations of n. 


Let us pause to appreciate the implications of this result. In the nematic 
phase, we dealt with two NG degrees of freedom, parameterizing via the 
unit vector n the coset space S*. In the smectic phase, we extended the 
coset space to S? x R. The corresponding new NG variable parameterizes 
the fluctuations of @ around its VEV (13.52). Given that ø only enters the 
effective Lagrangian with derivatives, it necessarily couples to a NG mode in 
the spectrum. However, at the same time, the fluctuations of n received a gap. 
Adding an extra NG field has reduced the number of NG modes from two to 
one! 

To understand what is going on, think of the order parameter (13.52) alone. 
Being a spatial vector, the gradient V@ spontaneously breaks spatial rotations 
in addition to the internal shift symmetry of ¢ and translations in the direction 
of no. We may then treat (n(x)) = no as a secondary order parameter, since 
it does not break any symmetries that would be left intact by (¢(x)). This 
makes the n mode redundant: its presence is not required to reproduce the 
symmetry-breaking pattern of a smectic. Consequently, the fluctuations of n 
are not protected by symmetry from acquiring a gap. 


This is an explicit realization of an issue I alerted the reader to already in 
Chap. 12. The number of true, gapless NG degrees of freedom (one) is fixed 
by the symmetry-breaking pattern. However, the additional, would-be NG modes 
contained in n(x) are physical and may manifest themselves as low-lying excitations 
of the smectic ground state. The nematic and smectic phases of liquid crystals are 
separated by a continuous (second-order) phase transition. At the transition point, 
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the scale k vanishes. Accordingly, near the phase transition, the gap of the n-type 
fluctuations is very small and they need to be included in the low-energy EFT. 

Further away from the phase transition, we expect the gapped modes to have little 
effect on the low-energy physics. In this regime, it should be possible to construct 
an EFT for smectics in terms of ¢(x) alone. This can be arrived at in two different 
ways. One possibility is to start from a combination of (13.51) and (13.54) and 
integrate out the vector n. The other possibility is to construct the EFT for ¢(x) 
alone from scratch. This is arguably more straightforward but less predictive, since 
the effective couplings of the EFT will be unrelated to those in (13.51) and (13.54). 
For illustration, I will nevertheless choose the latter option as it highlights the 
physics of the NG mode in the smectic phase. 

The task to find an EFT for a scalar field @, subject to an internal shift 
symmetry R, copies closely the construction of Sect. 13.2.2, with two differences. 
The first difference is minor. Namely, unlike in Sect. 13.2.2, we do not impose full 
Poincaré invariance, but merely the Aristotelian symmetry. The resulting effective 
Lagrangian is, in analogy with (13.29), a function of derivatives of (x) where 
spatial indices are contracted in a way preserving invariance under spatial rotations. 
The second difference is absolutely essential: we want our EFT to stabilize the 
anisotropic smectic state (13.52). To that end, let us parameterize the fluctuations 
of d(x) by a NG field r(x) such that (x,t) = kno - x + m(x,t). The basic 
rotationally invariant building block for the construction of the Lagrangian is 
(Vo)? = k? + 2kno - Vx + (V2)*. Dropping the constant piece, the part of the 
effective Lagrangian with one derivative per field will be a generic function of doz 
and kno - Va + (Vx)? /2. The static part of the effective Lagrangian, bilinear in the 
NG field z (x), is thus contained in 


(Vr)? 
2 


(Vr)2T 
5 | +.. (13.56) 


Zer > cı (kno: Vx + |+ e [tao va + 


The ellipsis includes operators that are bilinear in x(x) but contain more than one 
derivative per field. The leading operator of this kind is c3 (V7)? = 3 (Vr). 
Remarkably, the cı operator is forbidden. The easiest way to see this is to think 
of the gradient V@ as a vector field, A = Vø. The part of the static Lagrangian 
with one derivative per ¢@ is then a mere function of A. Should the corresponding 
Hamiltonian density have a minimum for nonzero (A) = kno as implied by (13.52), 
its Hessian matrix at the minimum must have two zero modes. This is just the 
statement of the Goldstone theorem in terms of the eigenvalues of the mass matrix, 
which we proved back in Sect. 3.1. Hence, for the smectic state (13.52) to be stable 
within the EFT, the Lagrangian (13.56) must not contain any terms quadratic in the 
part of the gradient of x, perpendicular to mo. It is the stability criterion together 
with the spontaneously broken rotational invariance that forbids the cı operator. 
We conclude that the leading contributions to the bilinear Lagrangian consist 
of the operators (do)?, (Vir)? and (Vir), where || and L denote projections 
to directions parallel and transverse to no. This suggests an unusual, anisotropic 
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power counting. For all these operators to contribute at the same order, the time 
derivative should have the same counting degree as V and Vi. The presence of a 
higher power of the transverse gradient has a profound impact on the stability of the 
smectic phase at nonzero temperature. I will relegate a more detailed discussion and 
further generalization of this observation to Chap. 15. 


13.3.3 The Unphysical: Nonrelativistic Superfluids 


The final example in our exploration of the role of vector modes will be nonrela- 
tivistic superfluids. Here we do know a priori that it is possible to construct an EFT 
solely in terms of a scalar degree of freedom. All one has to do is to take the EFT 
for a relativistic superfluid, developed in Sect. 13.2.2, and perform a nonrelativistic 
limit. It is however instructive to carry out the construction of the nonrelativistic 
version of the EFT from scratch. 

To get started, recall the basic properties of Galilei symmetry from Sect. 12.4.4. 
The Lie algebra of the Galilei group in d spatial dimensions includes the generators 
P, and H of space and time translations and J,s of spatial rotations. In addition, 
there is a vector generator K, of Galilei boosts. I will add from the outset a central 
charge Q whose eigenvalues measure nonrelativistic mass. This turns the Galilei 
group into the Bargmann group. Altogether, the nontrivial part of the local structure 
of the symmetry group is determined by the commutation relations 


[H,Kr]=iP,, [Pr Ks] = iôrsQ . (13.57) 


The commutators of J;s are fixed by rotation invariance and will not be needed. All 
the other commutators not listed here are vanishing. Mathematically, the Bargmann 
group has the structure G ~ SO(d) x {Re x [R? x U(1)g]}. All the symmetry 
transformations but translations leave the spacetime origin fixed, hence its isotropy 
group is Ho œ [SO(d) x R2] x U(1)ọ. The translations themselves will be 
represented by the operator Ty, = e” eP, 

In order to describe the superfluid order, we need an order parameter that carries a 
nontrivial action of the internal U(1)g subgroup. Similarly to relativistic superfluids, 
the simplest choice is a complex scalar field y. For any wo ¥ 0, the corresponding 
isotropy group is then Hiy,.0) ~ SO(d) x Ri. However, as pointed out before, this 
is not satisfactory, since it violates our requirements on the decomposition (13.6) of 
the Lie algebra g of G. Namely, the operators P, and H span a basis of g/þo but 
do not carry a representation of H(y,),0). The way out is to add a secondary order 
parameter A“ = (A®, A) that transforms as a vector under Galilei boosts. Choosing 
Ab = (a, 0) with a ¥ 0 can be interpreted as specifying the density of the U(1)g 
charge in the rest frame of the superfluid. It reduces the isotropy subgroup further 
to Heyo, A0),0) ~ SO(d). The coset space Ho/ Ai, Av),0) S RZ x U()@ can be 
parameterized by NG variables x and é” through U(r, £) = e!™2e**. In this 
parameterization, a Galilei boost with velocity v acts on the NG variables and the 
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spacetime coordinates as 
King x,t) = (m +v- x+ vt/2,E +v, x +t, t). (13.58) 
With all the pieces at hand, we can now compute the MC form (13.3), 


olr, E, x, t) = Oda — £ - dx + (1/2)£?dt] + K - dë + Hdt + P - (dx — Edt) . 
(13.59) 


The wy part of the MC form is trivial. From wp = Hdt + P - (dx — dt), we extract 
the spacetime coframe, e”? = dt and e"! = 5£(dx* — &°dr). The 6£é*dr term in e*£ 
does not affect the spacetime volume form, we can thus use the volume measure 
dx dt to turn effective Lagrangians into effective actions. To construct covariant 
derivatives of the NG fields, we will also need the spacetime frame, dual to the 
above coframe, 


gl, e=", 220, wed, (13.60) 


The covariant derivatives of 2 and é” are then extracted from the w; part of the MC 
form (13.59) with the help of (13.15), 


Vor = don +&- Va —&7/2, Vem = ôi (Os — &s) , 


(13.61) 
Vog” = (8 +£- VDE", VE = 8M aye" . 
It is an easy exercise to check explicitly that all these covariant derivatives are 
invariant under the Galilei boost (13.58). Thanks to their simple form, it is not 
necessary to distinguish frame and coordinate-basis indices. In the following, I will 
therefore happily use V,m = 0,2 — &, and Vs&" = sé". 


In accord with the general machinery of nonlinear realizations, we could also 
add matter fields, organized in linear multiplets of Ho, A0),0) ~ SO(d). 
By (13.17), the covariant derivatives of a matter field x? would then be 
Vox® = (00 +&-V)x® and V,x° = ə x£. This opens the possibility to 
promote (almost) any nonrelativistic theory with Aristotelian symmetry to 
a theory that is Galilei-invariant. Indeed, recall that in Part III of the book, 
we dealt largely with Aristotelian EFTs where we imposed invariance under 
linearly realized rotations by hand. What prevents us from introducing an 
auxiliary vector field £” and replacing 09 —> Vo = ðo +é- V everywhere? This 
will certainly work for strictly invariant Lagrangians. (The naive replacement 
d9 — Vo might spoil the quasi-invariance of Lagrangians that shift upon 
a symmetry transformation by a total time derivative.) The only fly in the 


(continued) 


13.3. Vector Modes: The Relevant, the Irrelevant and the Unphysical 309 


ointment is that we do not have a physical interpretation for the £” field 
in terms of the NG degrees of freedom of the Aristotelian EFT. Rather, £” 
should be viewed as the local velocity of a medium carrying the NG modes; 
the combination ðo + & - V is often referred to as the material derivative. 
The Aristotelian EFTs constructed in Chap. 8 describe the low-energy physics 
in the rest frame of the medium. The same remark applies to relativistic 
systems. There, adding a NG field for Lorentz boosts allows one to promote 
an Aristotelian theory to one invariant under the full Poincaré group. 


With the basic building blocks at hand, we can proceed to the construction of an 
EFT for the superfluid. Following closely the relativistic counterpart in Sect. 13.2.2, 
we first fix a power-counting scheme. We again count both doz and Vz as degree 
zero. For the covariant derivatives in (13.61) to have a consistent counting degree, 
we have to assign £" degree zero as well. Any other derivative acting on m or é” 
will then have degree one. It follows that at the LO of the derivative expansion, 
we can ignore V”. The LO Lagrangian will be some function of the rotationally 
invariant operators Voz and 6’*V,7V,7. This underlines the fact that the field £” 
is merely auxiliary and does not represent independent degrees of freedom. Indeed, 
it only enters the LO Lagrangian without any derivatives and can be eliminated 
algebraically by imposing its EoM, 


(13.62) 


(Ore ee) es (5"V,1V, 2) 


ot | 
The natural solution is € = Vz, which turns the two building blocks for the LO 
Lagrangian to Vow — dow + (Vx)? /2 and V,m — 0. Of course, (13.62) can also 
be satisfied if the expression in square brackets vanishes. This would imply that the 
LO Lagrangian only depends on Vor + (1/2)8"° V, Vyr = ðor + (Vx)? /2. Either 
way, we end up with the same LO Lagrangian in terms of z alone. 

Finally, recall the superfluid ground state can be described by a time-dependent 
VEV of a complex scalar field. This amounts to (7(x)) = —ut where ju is the 
chemical potential. The fluctuations around this background can be parameterized 
by a field 7 such that w(x) = a(x) — pt. Following again the analogy with 
Sect. 13.2.2, we conclude that the LO effective Lagrangian for Galilei-invariant 
superfluids reads 


L lz] = P (u — dot — (Vit)?/2) (13.63) 
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The function P(j) represents the thermodynamic pressure of the superfluid in 
equilibrium at zero temperature. 


As a simple application of our new EFT, expand the Lagrangian (13.63) to second 
order in the NG field 7. Up to a surface term, we find 


P" (u) (az)? — P' (u) 


Lye [7] P(u) + 5 ea Ne oes l (13.64) 


This gives immediately the phase velocity of the NG boson, 


P PE d (13.65) 
P(w) Vda’ l 


where n(u) = P'(u) should be interpreted as the mass density of the superfluid. 
This matches the relativistic result v = ./dP/dU in (13.35), since in the 
nonrelativistic limit, the energy density U (jz) reduces to the density of the rest mass. 
Note that in spite of the appearance, the EFT (13.63) describes a type-A NG mode 
with a linear dispersion relation. This justifies a posteriori the use of the power- 
counting scheme where temporal and spatial derivatives are counted equally. 


Before closing the discussion of nonrelativistic superfluids, I have a debt to pay 
off. Namely, when constructing the EFT, I tacitly assumed the effective Lagrangian 
to be strictly invariant. Could the Lagrangian also include some quasi-invariant 
contributions? The answer to this question is surprisingly rich, but luckily will not 
affect the LO Lagrangian (13.63). 

As pointed out in Sect.8.1, quasi-invariant Lagrangians for broken internal 
symmetries are classified by relative Lie algebra cohomology. It is not clear 
whether this elegant result survives all the complications of nonlinear realization 
of spacetime symmetries. Let us nevertheless take a leap of faith and see where it 
will lead us. We thus seek (D + 1)-forms that are closed and Ho-invariant. I will 
first split the MC form (13.59) into components corresponding to the individual 
generators, 


wg = dr —&-dx+(1/2)édt, wy = dé’, 


on = dt , oh = dx” — "dt . 


(13.66) 


We are now to take the exterior product of D + 1 of these 1-forms. Invariance 
under the whole isotropy group Hp is ensured by contracting vector indices in a 
rotationally invariant manner. It is the closedness requirement that poses a challenge. 
To that end, we take note of the MC equations, reflecting the commutation 
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relations (13.57) between the generators, 
dog = swp hoy, dæk =0, dwy =0, dwp=ayAwk. (13.67) 
These suggest a set of d + 1 naturally closed, rotationally invariant (D + 1)-forms, 


(n) 1 ri r Intl 
QD EryrgOH Awg AWK A AOR Awp 


Sy Av Awp (13.68) 


1 
= —€y,...,dt A (da — &- dx) A dE" A- A dE™ A dx’! A+. A dx" , 


d! 
where n = O,...,d. The corresponding effective Lagrangians are equivalent to 
the D-form potentials of these (D + 1)-forms, that is such Q Q that day = Cis 


These D-form potentials are obviously not unique, since they can be shifted without 
consequences by the exterior derivative of any (D — 1)-form. A suitable choice is 


1 
co = ayer radi A [rae A+++ AdE™ A dx™t! A +--+ A dx” (13.69) 


n 2 
ET A... A dE™! A dx™ A--- A dx" l. 
taar a £ x r 


A reader wishing to check this may find the following identity useful, 


Erer dE? A dE™ A+++ AdE™! Adx™ A+++ A dx 


d—n+1 
2 -TE erdi" AdE A o A dE™ A dx™t A + A dx, 


(13.70) 


Next, we eliminate the auxiliary field £” by imposing the same constraint as above, 
&€ = Vz. After some manipulation, the D-form potentials (13.69) then give a set of 
d + 1 quasi-invariant Lagrangians (with adjusted overall normalization), 


n 
ZY = aT ka [[@, Os, 70) . (13.71) 


quasi 
i=l 


The form of the Lagrangians (13.71) should ring a bell. Indeed, except for 
replacing spacetime indices with spatial ones, these are exactly the Galileon 
Lagrangians (10.54). The similarity is not accidental. The generators Q, K„ of 
the constant and linear parts of the Galileon symmetry (10.53) and the energy- 
momentum operator P,, satisfy the commutation relation [P,,, Ky] = iguyQ. This 
is identical to the centrally extended commutator in the Bargmann algebra. We 
have thus killed two birds with one stone, explaining the origin of the Galileon 
Lagrangians as a byproduct of our discussion of Galilei-invariant superfluids. Now 
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back to the physical content of (13.71). The n = 0 Lagrangian, is is a tadpole 
operator. This must be dropped to keep the EFT stable. The n = 1 Lagrangian 
is equivalent to (Vz)? by partial integration. This is, in spite of appearance, 
already included in the LO effective Lagrangian (13.63). Namely, it is a part of 
doa + (Wsr)?/2 where doz itself is a surface term. Finally, all the quasi-invariant 
Lagrangians (13.71) with n > 2 contain more than one derivative per field. 
Consequently, they will only contribute at higher, subleading orders of the derivative 


expansion of the superfluid EFT. 


13.3.4 Inverse Higgs Constraints 


The analysis of concrete examples has given us enough experience to appreciate the 
broader spectrum of subtleties associated with spontaneous breaking of spacetime 
symmetries. Let us reevaluate what we have learned in particular about the role of 
vector fields. I should warn the reader that this matter is not yet quite settled. Hence, 
the content and organization of the discussion below is necessarily more subjective 
than elsewhere in the book. 

In case of internal symmetries, the situation is clear. There is a one-to-one 
mapping between broken symmetry generators and NG fields in the low-energy 
EFT. The construction of the EFT itself is fairly streamlined. All one needs to know 
is the symmetry-breaking pattern. As shown in Chap. 8, finding the EFT then boils 
down to solving certain linear-algebraic constraints for the tensor couplings in the 
effective Lagrangian. The functional dependence of the Lagrangian on the NG fields 
is completely fixed by symmetry. 

In case of spontaneously broken spacetime symmetry, we would hope to be able 
to do the same. This desire lies behind the agnostic nonlinear realization that I 
already mentioned in Sect. 12.3.2. In this framework, one starts with a separate 
NG field for each spontaneously broken symmetry generator. The actions of all 
the generators in an abstract group space are by construction independent of each 
other. This however leads to the surprising observation that the number of fields 
that ultimately couple to gapless excitations may be lower than that of the broken 
generators.° I will now outline the various mechanisms how the naively expected 
number of NG fields based on mere counting broken generators may be reduced. 
This will add nuance to the intuitive picture painted in Sect. 6.1 and summarized in 
Fig. 6.1 therein. The different mechanisms are listed in Fig. 13.2. 


Redundancy of Symmetries Two symmetries that are locally equivalent neces- 
sarily lead to identical fluctuations of the order parameter. This is the simplest 


6 Remember that these independent NG fields may still not be in a one-to-one correspondence with 
NG modes in the spectrum. Some of the fields may be pairwise canonically conjugated, leading to 
type-B NG bosons. The canonical conjugation mechanism is the same for internal and spacetime 
symmetries, and thus requires no further discussion. 
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Fig. 13.2 Various broken 
mechanisms whereby the generators 
number of physical NG fields 

coupling to gapless, true NG 
modes can be lower than the 
number of generators 
parameterizing the coset 
space Ho/ Hyo,o). The 
detailed meaning of the 
keywords involved is 
explained in the main text. 
The physical NG fields are in 
turn in a one-to-one (type-A) 
or two-to-one (type-B) 
correspondence with the NG 
modes in the spectrum 


redundant 
symmetries 


redundant order 
parameter fluctuations 


secondary order 
parameter 


required 
gapped modes 


physical 
NG fields 


and most important of all the mechanisms, to which I already devoted the whole 
Chap. 11. Its advantage is that such a redundancy can be checked at the level 
of classical symmetry transformations. No specific choice of an order parameter 
or knowledge of the corresponding symmetry-breaking pattern is needed. See 
Sect. 11.3 for examples. 


Redundancy of Order Parameter Fluctuations Moving down the decision tree, 
the next possibility is that some symmetries are locally distinct, yet lead to 
equivalent fluctuations of the order parameter. An example of this type is the 
helimagnet; see Example 6.5 for details. Here spatial translations and rotations are 
not trivially redundant due to the vector nature of the order parameter. However, 
thanks to the particular spatial modulation of the order parameter, three independent 
broken symmetry generators lead to mere two distinct types of fluctuations. This 
type of redundancy requires detailed knowledge of the order parameter, but can still 
be detected straightforwardly following the approach of Sect. 6.3.1. 

Both of the above types of redundancy are automatically accounted for by the 
standard nonlinear realization of Chap. 12. This ensures a one-to-one parameter- 
ization of whatever fields carrying the action of the symmetry one starts with. 
However, it is sometimes advantageous to add auxiliary would-be NG fields, even 
if these are known beforehand not to correspond to separate physical degrees of 
freedom. We saw an example in Sect. 13.3.3. The unphysical nature of the vector 
field £” therein manifested itself in the fact that we could eliminate it algebraically 
using its EoM. Such a procedure, while physically certainly correct, may however 
become exceedingly cumbersome beyond LO of the derivative expansion. There 
is an alternative that allows one to eliminate €" algebraically before one starts 
constructing the effective Lagrangian [7]. Indeed, note that the covariant derivative 
V,m in (13.61) is linear in Æ" without any derivatives on it. Setting it to zero 
gives € = Wz, which is an algebraic condition that is covariant under all the 
symmetries of the EFT. This is an example of an IHC. Imposing the IHC brings 
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us back to the nonlinear realization of Galilei symmetry solely in terms of the scalar 
x. Yet, invariance under Galilei boosts is still automatically implemented. Galilei- 
invariant Lagrangians can be constructed using the same building blocks (13.61) 
while imposing by hand just invariance under spatial rotations. The latter can be 
ensured using standard tensor methods. 


The auxiliary field £” in nonrelativistic superfluids is associated with Galilei 
boosts. In this concrete case, the corresponding would-be NG modes are 
clearly unphysical. However, it seems to be true generally that there are 
no observed single-particle NG states for which boosts would be the sole 
associated broken symmetry. Curiously, the Goldstone theorem for boosts 
can be saturated by states from a continuum spectrum rather than by stable 
bosonic quasiparticles, as in Fermi liquids [8,9]. A class of mathematically 
consistent EFTs that feature NG bosons originating in boost symmetry has 
been proposed in [10]. 


Eliminating the auxiliary vector field using its EoM or using an IHC must 
lead to EFTs with the same functional form. The only difference is possibly 
in the identification of the effective couplings, but these are usually determined 
subsequently by matching the predictions of the EFT to selected observables. This 
makes the two approaches entirely equivalent. The advantage of using the classical 
EoM (or generally, at loop level, integrating the auxiliary field out) is that it is 
physically transparent. The IHC, on the other hand, is usually technically simpler. 
The drawback is that there is no universal algorithm how to find a suitable IHC. As 
a rule, one has to follow intuition combined with insight in the concrete problem at 
hand. See [11] for a discussion of some of the traps involved. 


Gapped Modes from Secondary Order Parameter Eliminating the redundancy 
of either type, we are back to the standard nonlinear realization of spacetime 
symmetry, developed in Sect. 12.3. Here all the fields, whether NG fields 2“ 
or matter fields x°, describe independent, physical degrees of freedom. Barring 
possible NG modes of spontaneously broken translation symmetry, the NG variables 
are classified as coordinates on the coset space Ho/ H(y,0). However, there may be 
different choices of the order parameter(s) leading to the same symmetry-breaking 
pattern but different coset spaces Ho/H(yo,0), hence different numbers of NG 
variables. This possibility owes its existence to the distinction between the isotropy 
groups Ho, H(y,.,0) and the unbroken subgroup Hy in case of spacetime symmetries. 

A phenomenologically relevant example is the smectic liquid crystal, discussed 
in Sect. 13.3.2. (Another, relativistic example of this type can be found in [12].) Let 
us inductively generalize what we observed therein. Suppose there is a secondary 
order parameter that can be removed from the EFT without affecting the symmetry- 
breaking pattern. Then the associated NG variables are not required to be present by 
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the broken symmetry. By the same token, the corresponding modes in the spectrum 
are not protected by the symmetry from acquiring a gap.’ 

Here one has two choices how to go about the construction of the EFT. If it 
is possible to identify the secondary order parameter beforehand, one can drop it 
from the outset. The resulting minimal nonlinear realization of the given symmetry- 
breaking pattern then only includes genuine NG fields. Accordingly, all modes 
described by the EFT are gapless, that is true NG bosons. Alternatively, one may 
proceed with the nonlinear realization that includes the secondary order parameter. 
The extra would-be NG fields are then dynamical and couple to gapped modes in 
the spectrum. If desired, these fields can be eliminated either by using their EoM or 
by imposing a suitable IHC. 


Example 13.7 


Let us have a closer look at smectic liquid crystals (Sect. 13.3.2). Here the primary 
order parameter is (@(x, t)) = kno - x. This breaks rotations that do not preserve the 
axis of alignment defined by mo. Moreover, it breaks translations along no together 
with the emergent shift symmetry of (x) down to the “diagonal” subgroup. 
Altogether, there are three spontaneously broken symmetry generators. One of these 
corresponds to the genuine NG mode described by the EFT (13.56). In addition, 
there are two gapped modes that can be viewed as fluctuations of the secondary 
order parameter, (n(x)) = no. 

Suppose we start from an EFT including all three degrees of freedom. This might 
even be desirable near the nematic-smectic phase transition at which the modes 
excited by the vector field n(x) become gapless. However, should we decide to 
eliminate the gapped modes, we need a covariant IHC. To that end, note that the 
covariant derivative V,@ in (13.53) splits under the action of the isotropy group 
Hyo,0) ~ SO(2) into a singlet, Vad, and a doublet, Vag. The condition Vad = 0 is 
covariant under all the symmetries of the smectic and moreover is algebraic in 2“. 
In a somewhat more human notation, it is equivalent ton x V@ = 0. The solution 
to this IHC is, up to overall sign, n = V¢/|V¢@|. 

In fact, the IHC can do more than to eliminate unwanted degrees of freedom. 
According to (13.20), covariant derivatives of NG fields 7“ extracted from the w, 
part of the MC form transform exactly as matter fields with respect to some linear 
representation of H(y,,0). The very fact that the constraint Va = 0 can be solved 
for x“ means that the Jacobian matrix 0(Vad)/ dz” is nonsingular, at least around 
the reference point z“ = 0. Instead of setting it to zero, we can therefore treat 
Va = Xa as a new field variable that replaces x“ in the EFT. We conclude that the 
gapped degrees of freedom are indistinguishable from ordinary matter fields. This 


7 A similar phenomenon exists even for internal symmetries. As stressed in Sect. 8.3.2, the primary 
order parameter responsible for type-B NG modes in the spectrum is the commutator matrix (8.26). 
The presence of an additional, secondary order parameter may lead to gapped “partners” of the 
type-B NG modes. See [13] for a detailed discussion within Hamiltonian formalism, or Sect. 8.3.1 
for an outline of how these gapped partner states manifest themselves in the effective Lagrangian. 
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is the underlying reason why they can be removed from the EFT without spoiling 
the nonlinear realization of the broken symmetry. And for the same reason, it is no 
wonder that the broken symmetry does not force them to remain gapless. 


The same reasoning applies to Galilei-invariant superfluids (Sect. 13.3.3). Even if 
one does not wish to eliminate the auxiliary field £" from the EFT, one can still trade 
it for x = & — Vz. This is invariant under Galilei boosts and transforms as a matter 
field in the vector representation of the linearly realized isotropy group SO(d). The 
observation that whenever a set of would-be NG fields can be eliminated from the 
EFT by imposing some IHCs, they can instead be kept and traded for a set of matter 
fields, is general [14]. The change from the would-be NG fields to the matter fields 
is a field redefinition that is nonsingular. This is guaranteed by the fact that the IHCs 
are algebraically solvable. 

The distinction between the three mechanisms discussed so far crucially relies 
on the type of order parameter breaking the symmetry of the system. Within a 
naive EFT based on the agnostic nonlinear realization, there is therefore no way 
to distinguish these scenarios. One has to accept the possibility that some would-be 
NG degrees of freedom of the EFT couple to gapped states. Within the EFT itself, 
it is not possible to tell whether such states are physical or mere artifacts of the EFT 
setup. This suggests that when it comes to spacetime symmetries, the symmetry- 
breaking pattern itself is not restrictive enough. To get more detailed insight, one 
needs additional low-energy data such as the order parameter. See [15,16] for further 
discussion. 


Gapped Modes Required by Order Parameter So far, generators not leading to 
independent NG degrees of freedom could always be identified by inspecting the 
classical symmetry transformations and the order parameter(s). Interestingly, that 
is not the end of the story. There are theories featuring would-be NG fields that 
couple to gapped modes which cannot be discarded by dropping a secondary order 
parameter. See [17] for an example and more details. It is then still possible to 
eliminate such fields using either the EoM or a suitably chosen IHC. One should 
however expect the resulting minimal nonlinear realization of symmetry to involve 
generalized local transformations that depend on derivatives of the NG fields. As 
long as one insists on using solely point symmetries, it is not possible to recover 
the minimal nonlinear realization by reducing the order parameter. In this sense, the 
presence of the gapped modes is required for a description of the symmetry-breaking 
pattern based on a local order parameter. 

In all the scenarios outlined above and summarized in Fig. 13.2, the IHCs play a 
prominent role. It is therefore important to stress that they are just a technical tool. 
We always have the option to work with the full EFT based either on the agnostic 
nonlinear realization or the standard nonlinear realization of Chap. 12. We just have 
to live with the possibility that the EFT may include some gapped modes. These of 
course become irrelevant at sufficiently low energies. The set of genuine NG degrees 
of freedom is fixed unambiguously by the symmetry-breaking pattern. 
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How to choose the IHC in practice is a kind of art in its own right. The examples 
I presented above were all mercifully simple. Nonetheless, there is a useful rule of 
thumb rooted in the expression (13.3) for the MC form. Namely, if the commutator 
[Pu, Qa] contains Qp, then the NG field x| appears linearly and without derivatives 
in the œ component of the MC form. It therefore seems possible to eliminate 1“ by 
setting w? (or the irreducible multiplet of components of the MC form containing 
it) to zero. This naive algorithm however requires further conditions to work [18], 
and moreover depends sensitively on the exact choice of field parameterization [19]. 
It is therefore comforting to know that whether or not one chooses to impose some 
THCs has no effect on the low-energy physics of the theory. 


13.4 Genuine Breaking of Translation Invariance 


Until now I assumed that whatever the pattern of symmetry breaking, the low-energy 
physics is captured by an EFT that lives on the coset space Ho/H(yp,0). AS we 
saw, this is appropriate for systems where the value of the order parameter at all 
spacetime points lies on the same orbit of the symmetry group. Any matter fields 
X£ that might be present can then be discounted. In this last section of the chapter, 
I will briefly outline the changes that are necessary to account for fluctuations 
of generic coordinate-dependent order parameters. The nonlinear realization of 
spacetime symmetry developed in Chap. 12 remains valid even in this general case. 
This is because the construction does not depend on the order parameter as a 
function on spacetime, merely on the choice of the representative point wo € M. 
By the same token, everything said in Sect. 13.1 about the MC form and covariant 
derivatives still applies. 

The new complication is that we now have to explicitly include the matter fields 
x°(x) in the game. These may contain NG degrees of freedom that are not detected 
by the Ho/H(y,0) coset space, namely those of spontaneously broken translations. 
The challenge we face is to find a parameterization of the matter fields that makes 
the existence of the translation NG modes manifest. As of writing this book, there 
does not seem to be a general parameterization that would do the job in presence 
of other, possibly spontaneously broken, symmetries. I will thus merely sketch the 
basic idea in the simplest case of spacetime symmetry breaking by an otherwise 
featureless order parameter. 


13.4.1 One-Dimensional Modulation of the Order Parameter 


To further simplify the problem, consider a single real scalar field #(x) in a theory 
invariant under a purely spacetime symmetry group G. This may be the Aristotelian 
group or one of its extensions including boosts such as the Galilei or Poincaré group. 
The isotropy group Ho contains the transformations that act linearly on spacetime 
coordinates, that is rotations and boosts. For any choice of ¢@9 € R, the isotropy 
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group A(g),0) ~ Ho makes the coset space Ho/A(¢y,0) trivial. There are no NG 
variables and the sole degree of freedom, namely ¢ itself, is of the matter type. 

Suppose now that the VEV (@(x)) varies along one particular direction in space. 
In order to maintain Poincaré (or Galilei) covariance if desired, I will characterize 
this preferred direction by a fixed spacelike vector n”. Our order parameter for 
spacetime symmetry breaking is then defined by a function ọ of a single variable 
such that 


(P(x)) =e: x). (13.72) 


We would eventually like to trade (x) for a NG field m(x) that manifests 
spontaneous breaking of translations in the direction of n”. Before we proceed, 
let me stress that the order parameter (13.72) will as a rule also break rotations 
and (if present) Lorentz or Galilei boosts. However, these never enter our setup 
explicitly. For scalar fields, we already know that both spatial rotations and boosts 
are descendant symmetries in the sense of Chap. 11. Their nonlinear realization will 
therefore be automatically taken care of by the same NG field 7 (x). 

Let us now start with a purely mathematical problem on functions of a single 
variable. I will call the latter z to distinguish it from the spacetime or spatial 
coordinates, x“ and x’. Suppose we are given a function g(z) that “breaks 
translation invariance,” that is, it is strictly monotonic, y’(z) 4 0. We would 
like to parameterize the “fluctuations” of g(z) in a way that makes the breaking 
of translation invariance manifest. We do so by locally shifting the graph of the 
function by m(z) along the z-axis. This defines uniquely a new function @(z) via 
(z) = v(z — x (z)), see Fig. 13.3. Thanks to the assumed strict monotonicity of 
y(z), the mapping between x (z) and ¢ (z) for fixed (z) is one-to-one. 

This little trick can be promoted to an arbitrary number D of spacetime 
dimensions, provided g as assumed only varies in one particular direction. The 
coordinates corresponding to all the other directions in spacetime play a role of fixed 
parameters in the mapping between ¢ and zr. Thus, I assume that y(n - x) in (13.72) 


Fig. 13.3 Two 
parameterizations of the 
fluctuations of a strictly 
monotonic function g(z). The 
first amounts to a local shift 
x (z) of the graph of (z) in 
the horizontal direction. The 
second is given by the 
function ¢ (z) corresponding 
to the shifted graph. 
Mathematically, the two 
parameterizations are related 
by (z) = ykz — z (z)) 
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is a strictly monotonic function of its argument. This is necessary for translations 
along n” to be spontaneously broken at each spacetime point. The scalar field ¢ (x) 
can then be traded for 2 (x), defined implicitly by 


$x) =9(n-x—M(X)). (13.73) 


The function ø as defined by (13.72) obviously depends on the choice of 
coordinates. For instance, (n - x) and g(n - x + c) with any c e R describe 
the same order parameter, differing only by a coordinate shift along n”. This is 
a hallmark of translation symmetry breaking. The introduction of the NG field 7 (x) 
makes it possible to map a fixed function ¢ to a scalar field ø. This requires that 
under a spacetime transformation, be it a translation, rotation or boost, the NG field 
transforms as 


w(x’) = a(x) +n’- x’ —n-x. (13.74) 


In the special case of spatial translations, x —> x + €, this boils down to z’(x + 
€,t) = m(x,t) —n- €, where n is the spatial part of n”. Shifting the NG field by 
a constant is typical for spontaneously broken Abelian symmetries. We are on the 
right track. 

Now that we have settled the parameterization, we would like to construct the 
most general effective action consistent with all the symmetries present. Since there 
were no NG variables to start with, the MC form (13.3) reduces to P - dx. This gives 
a trivial spacetime coframe, e*“ = 54dx’. Also, by (13.17), the covariant derivative 
of the matter field ¢ is trivial, Vig = ôn vo. Therefore, an invariant action can be 
built by taking any Lagrangian density that does not depend explicitly on spacetime 
coordinates. It can contain @(x) with an arbitrary number of derivatives (including 
zero). The indices on the derivatives are to be contracted in a way that preserves the 
linearly realized group Ho. In case of a relativistic, that is Poincaré-invariant theory, 
any Lorentz-invariant Lagrangian density Zeg[@] will do. 

It looks like we have not made much progress. In particular, we have not learned 
anything so far about the possible dependence of the effective Lagrangian on the NG 
field zr (x). However, there are still two requirements we have not addressed yet. The 
first of these is that the order parameter (13.72) is at least a local minimum of the 
energy functional of the EFT. This will constrain what the effective Lagrangian may 
look like. The second requirement is to have a consistent power-counting scheme 
that would allow us to decide which operators in the Lagrangian are most important. 
In order to understand what is at stake, it is best to work out an illustrative example. 


320 13 Broken Spacetime Symmetry in Quantum Matter 


13.4.2 Case Study: Fluctuations of a Domain Wall 


Following Chap. 5 of [20], consider the class of relativistic scalar field theory 
models 


1 
Lot = 5b)" — V9) , (13.75) 
where V (@) is a potential function, required for stability to be bounded from below. 


We can assume without loss of generality that min V(@) = 0. The classical EoM of 
the model reads 


dav) _ 
eo 


o+ (13.76) 


Suppose the potential has (at least) two degenerate minima. Then the EoM has 
domain wall (or kink) solutions g that interpolate between the minima. Such 
solutions depend on a single spatial coordinate, which I will denote as z, and 
converge to the respective minima of the potential for z —> +00. Upon reduction to 
the single variable z, the EoM (13.76) has the first integral (1/2) (d@/ dz)? — V(¢). 
For the domain wall solutions, this first integral evaluates to zero since for z + +00, 
yv(z) converges to a constant such that lim V(g(z)) = 0. This reduces the EoM 
to the following first-order differential equation for (z), 


¢' (z) = £/2V(e(z)) . (13.77) 


The prime on ¢ indicates a derivative with respect to its argument. 


One of the most common scalar potentials is the double-well potential, 
À a2 22 
V@= 5 uy, (13.78) 


where A and v are positive parameters. A simple calculation gives the corresponding 
domain wall solution, interpolating between the minima at @ = +v, 


v(z) = +vtanh[VAv(z — zo)] . (13.79) 
Here zo is an integration constant that determines the center of the domain wall 


where (zo) = 0. The width of the domain wall is given by 1/(/Av). 
Another example of a phenomenologically important potential is 


Vi¢)= „(i — cos £) , (13.80) 
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where m and v are positive parameters. This appears for instance in the low-energy 
effective theory (9.21) of quantum chromodynamics, when restricted to neutral 
pions. In this realization, m corresponds to the pion mass my and v to the pion 
decay constant, fx. The potential V(@) is now periodic with minima at ọ = 2nzrv 
for any n € Z. The domain wall solution, connecting two neighboring minima, is 


(z) = 4v arctan exp[+m(z — zo)] . (13.81) 


In this case, the width of the domain wall is fixed by the mass parameter to be 1/m. 


Both concrete domain wall solutions in Example 13.8 satisfy our criteria for 
the translation-breaking order parameter modulated in one dimension. We would 
eventually like to derive a low-energy EFT for the fluctuations of the domain wall. 
The first step towards this goal is to use the general parameterization (13.73) with 
n-x = z. The potential term in (13.75) can then be eliminated by means of (13.77), 
upon which the Lagrangian becomes 


Lee = [p(z DP | Sua)" a am|. (13.82) 


The contribution of the 1 — 0,7 piece to action is a mere boundary term independent 
of local variations of 7 (x). Namely, f dz[g’(z — 2)? — 4.77) = J dz [o8 
where Z = z—z(x) andzi = lim zZ. Dropping this contribution, we are left with 
the following effective action [ier 


1 
Sert) = 5 f d?x [g'z — rP ur aN . (13.83) 


The EFT (13.83) apparently describes a derivatively coupled NG field x(x) as 
we wanted. However, it is still not quite satisfactory due to the explicit appearance 
of the domain wall background g(z). It is not even clear what the quasiparticle 
spectrum of the EFT is. To that end, we utilize the EoM, linearized in the NG field, 


29" (z) 
gy’ (z) 


(x) — 


0,1 (x) © 0, (13.84) 


where the © symbol indicates the linear approximation. It is convenient to switch to 
the linear fluctuation of the order parameter, x(x) = (x) — g(z).8 The linearized 


8 While z(z) is defined as the horizontal distance of the graphs of #(z) and g(z) in Fig. 13.3 (the 
other spacetime coordinates being fixed), x (z) can be viewed as their vertical distance. 
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EoM (13.84) is thus transformed to 


o" (z) 
o'z) 


x(x) + x(x) 0, X(x) © =p zr). (13.85) 


As a simple check, note that the domain wall solution satisfies g’”/y’ = 
q? V (ọ)/ dy”. Equation (13.85) then descends directly from the Lagrangian (13.75) 
by expanding the latter to second order in x(x). The form of (13.85) allows 
us to Fourier-transform in time and the transverse spatial coordinates (denoted 
collectively as x1). Parameterizing plane-wave solutions by energy FE and 
transverse momentum p1, 


x(x1,z, t) = X(z) exp(-iEt +ip, x1), (13.86) 


the profile function x (z) solves the one-dimensional eigenvalue problem 


B 2 o" (z) . B 2 A . 
| əz + Aios Pi). (13.87) 


In order to appreciate the content of this equation for single-particle states, let us 
return to the two concrete choices of potential introduced in Example 13.8. In both 
cases, I will for simplicity consider only the solution g(z) centered at zo = 0. 


Example 13.9 


For the double-well potential with the corresponding domain wall solution (13.79), 
o" (z)/g'(z) = 2av?[2 — 3 sech?(/Avz)]. Introducing a dimensionless coordinate 
Z = Vivz turns (13.87) into 


of ee ee ae 
( % xing) 1 =( Au? ALK): (13.88) 


The operator on the left-hand side is a special case of the Péschl-Teller Hamiltonian, 
A, = —33 — n(n + 1)/cosh? Z with positive n € Z. The spectrum of this 
Hamiltonian is well-known; see e.g. Chap. 11 of [21]. There are n bound states 
with eigenvalues —n?, —(n — 1)?,...,—1. The continuous part of the spectrum 
covers the open interval (0, oo). In the n = 2 case of interest to us, the ground state 
with eigenvalue —4 gives E = |p 1 |. The corresponding profile function is ¥(Z) « 
1/ cosh? Z. This solution is localized in the z-direction to the vicinity of the domain 
wall. It propagates as a plane wave in the transverse directions with a massless 
relativistic dispersion relation. The upper bound state of H2 with eigenvalue —1 
gives E? = Pi + 3Av?. Its profile function is (Z) œ sinh Z/cosh? Z. This 
solution is also localized to the domain wall and propagates only in the transverse 
directions, but it has a nonzero mass of /3Av. Finally, the continuum part of the 
spectrum of the Péschl—Teller Hamiltonian can be parameterized by the momentum 
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variable p, so that for z —> too, x(z) x exp(ipzz). The corresponding energy is 
EF? = Pi + p? + 4ìv?. This describes delocalized fluctuations of the domain wall 
that propagate in the entire space. They have a relativistic dispersion relation with 
mass 2/Av. 

In case of the cosine potential with its domain wall solution (13.81), 
o" (z)/¢'(z) = m?[1 — 2sech?(mz)]. With the dimensionless coordinate Z = mz, 
the eigenvalue problem (13.87) becomes 


3 2 jan [P-P ; 
(-a ~~ cosh? >) k(Z) = (= = EX) re 


The left-hand side is again one of the Péschl-Teller Hamiltonians, Hj. Its sole 
bound state has the profile function (Z) œ 1/cosh Z. This is a state localized 
to the domain wall that propagates in the transverse directions with the massless 
relativistic dispersion, E = |p|. The continuum part of the spectrum of this 
Péschl-Teller Hamiltonian again amounts to delocalized fluctuations of the domain 
wall. Their dispersion relation is E? = P% + p? + m°. Far away from the domain 
wall, they describe particles of mass m propagating in the bulk. 


We are now ready to draw some general conclusions. Starting with any potential 
V(@) with degenerate minima, the domain wall solution will be localized in 
the z-direction. The linearized EoM (13.84) always admits plane-wave solutions, 
propagating in the transverse directions with the massless dispersion relation, 
E = |p |, and with a constant longitudinal profile, 7(z) « 1. By (13.85), this 
corresponds to (z) « ’(z). The existence of this solution is guaranteed by the 
spontaneously broken translation invariance. Indeed, the fact that 7 (z) is constant 
indicates that this is the NG boson we are after. It does not propagate in the z- 
direction since translations in the z-direction are completely broken, hence there 
is no physical momentum p, to label one-particle states with. The NG mode 
propagates solely along the directions in which translations remain unbroken. 

The second observation is that there is a continuum of plane-wave-like solutions 
that propagate in the bulk. Far away from the domain wall where (z) goes to one of 
the minima of V(@), go, these are simple plane waves, x (z) x exp(ip,z). They have 
a relativistic dispersion relation with squared mass V” (go). In fact, we could have 
guessed this much by expanding the Lagrangian (13.75) around the uniform solution 
(f(x)) = go. In addition to the NG mode and the bulk plane-wave continuum, 
there may be further solutions to (13.85) that are localized on the domain wall 
and propagate only in the transverse directions. These correspond to excited bound 
states of the one-dimensional Hamiltonian on the left-hand side of (13.87). The 
presence of such massive excitations propagating along the domain wall depends on 
the concrete choice of the potential V (¢ġ). 

It is not a priori obvious why the NG mode with z(z) œ 1 is necessarily 
the lowest-lying excitation, that is, why the domain wall solution is stable. To 
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understand this, let us assume without loss of generality that (x) is monotonic 
in z and estimate its energy from below by completing the square, 


J d'r] dz |500? + TVo’ + (0-6) + vo] 
1 1 1 
7 f di-y de {5 (oo? + 5(Vi0) + 5[0 + V2V@)] + o woh 
$+ 
> f EN / dp (206) 


where 6s = lim (x). This estimate, known as the Bogomolny bound, shows that 
. 1 LL % koa . . 

static solutioñs satisfying (13.77) minimize the energy per unit transverse volume 

on the class of field configurations with a fixed boundary condition at z > +00. 


, (13.90) 


We have discovered that our EFT (13.83) inevitably describes both the transla- 
tion NG boson and specific gapped excitations. This is hardly surprising given 
that (13.83) is equivalent via a field redefinition to the original model (13.75). 
One might naively expect that an EFT for the NG mode alone could be 
extracted by integrating over z. After all, we found that the NG solution for 
x (x) does not depend on z at all. However, inserting a z-independent zr (x1, t) 
makes it possible to shift the z-coordinate in g’(z — zr), thus reducing (13.83) 
to 


Serr{t} = . f dz [g' (2)] f d@~'x, dt [(dor)? —(Vi)*]. (13.91) 


This is a noninteracting theory, describing free surface waves on the domain 
wall. We conclude that all interactions among the NG modes in the 
EFT (13.83) are mediated by the gapped modes. 


The problem of disentangling the NG mode from the gapped excitations is related 
to our ability to organize contributions to the EFT by a derivative expansion. Also, 
we have so far been forced to work with specific domain wall solutions (z), 
descending from the chosen model potential V(¢). We would rather have a model- 
independent EFT where the stability of the ground state and the existence of a power 
counting serve merely as constraints. I will now address these issues jointly in a 
framework inspired by the background gauge invariance approach of Sect. 8.2. 
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13.4.3 Effective Action from Background Gauge Invariance 


The idea to use background gauging to derive an EFT for spontaneously broken 
translations has been utilized in multiple contexts, including inflationary cosmol- 
ogy [22] and superconductivity [23]. Here I will loosely follow the exposition 
in [16]. We are looking for an effective action Seg{a} for the NG mode of 
spontaneously broken translations in the z-direction. In the spirit of Sect. 8.2, we 
first promote the physical symmetry to a local invariance by adding a set of 
background gauge fields. Localizing spacetime translations leads to the group of 
spacetime diffeomorphisms. The precise choice of the corresponding background 
field is constrained by the physical symmetry and field content. In case of relativistic 
theories of scalar fields, one can use a spacetime metric, g,,, (x), to make the gauged 
action Ser {2, g} diffeomorphism-invariant. 

The transformation of the metric under a diffeomorphism of the spacetime 
manifold, x —> x'#(x) = x + e#(x), is given by the push-forward by the 
inverse of the diffeomorphism; see Appendix A.3.1. This must be augmented with 
a transformation rule for the NG field z(x). In order to maintain the relation 
d(x) = (z — m(x)) between a scalar field @(x) and a fixed function (z), (x) 
should transform as 


mx! (x!) = w(x) + F(x). (13.92) 


The NG field can now be eliminated by choosing the diffeomorphism as «“(x) = 
—6!z(x). With a slight abuse of notation, I will denote this operation as J_z. This 
reduces the action to Seg{z, g} = Serr{0, Fxg}. The dependence of the action 
on the composite gauge field (7_7g),» is constrained by invariance under the 
residual group of diffeomorphisms that fix the point m = 0. These are (possibly 
z-dependent) (D — 1)-dimensional diffeomorphisms with €*(x) = 0. Once the most 
general admissible action Seff{0, 7_z g} has been found, the EFT in flat Minkowski 
spacetime can be recovered by setting g,,,(x) to the Minkowski metric. 

To constrain the form of Serr{0,7_7g}, it is convenient to separate the z- 
coordinate from the other spacetime coordinates. I will indicate those using 
lowercase Fraktur indices m, n,.... It will also be more practical to work with the 
inverse of the metric, g”” (x), rather than g,,,(x) itself. The transformation of the 
metric under diffeomorphisms then splits as 


gi = g (87 + dae )(5p + pe") + g% (87 + ðae™)ðze" 
+ gd," (5, + Ope") +g 0E" E", 

gM = gM ST + due )OpE* +.B (Sy + Da ™ A + 82E) + geze" e? 
+ga" + d:€%) , (13.93) 


g' = gM Agee? + g Ine (1 + 82E) + gA + dze) + gE + d,€%). 
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For the sake of brevity, I suppressed the arguments x of g(x) and x’ of g/#"(x’). 
The transformation rule for g*™ is analogous to that for g™. 

Under the restricted (D — 1)-dimensional diffeomorphisms with €*(x) = 0, 
g7 (x) behaves as a scalar, g’**(x’) = g7 (x). On the other hand, g" = g% (80 + 
da€™) depends on both g" and g%. Finally, g/"" = g% (6m + da€™) (êg + dge") 
depends on all g™", g", g*" and g**. This suggests that our main ingredient for 
constructing diffeomorphism-invariant actions will be 


Trg)” = g™ dar dyn — 2g% 3an (1 — 8.x) + g7 — a,x)" 
(13.94) 
= g” — 29°" Oy 7 + g ay px ; 


where the argument of (Trg) is X“ = x” — ô! n(x). The z-component of x“, 
that is Z = z — 7 (x), is itself invariant under spacetime diffeomorphisms and can 
thus appear in the effective action without restrictions. Finally, the effective action 
may also contain (J_,g)"* and (7_,g)"". These, not being scalars, can however 
only enter through higher-derivative tensors such as the Riemann curvature tensor. 
See Appendix A of [22] for a discussion of geometric structures allowed by the 
restricted (D — 1)-dimensional diffeomorphism invariance. For our purposes, the 
main conclusion is that the part of the effective action dominant at low energies will 
be included in 


Sesel, 8) = / dE vol(T =n 8)%) FE TDO) 
(13.95) 
= | dx vol(g)(x) f(z — (x), (Tn) (%)) . 


Here f is a generic smooth function of two variables. The volume measure 


vol(g) for a metric of (timelike) Lorentzian signature equals ./(—1)¢ det g; 
cf. Appendix A.7.2. 


Using the specific choice f(u, v) = (1/D[¢'(u)]? (v — 1) and subsequently going 
back to the flat Minkowski spacetime gives 


Setir} = : fers [o'(z — 2) IOT)? — 201 — a,x) . (13.96) 


This reproduces our previous model result (13.82). 
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A systematic expansion in derivatives of the NG field is easier if one uses the 
combination (7_7g)** + 1, which goes to 20,7 + (0,7)? in the flat-spacetime 
limit. This suggests expanding the function f(u, v) in (13.95) in powers of v + 1. 
Up to second order in v + 1 and thus in derivatives, we have f(u,v) = co(u) + 
cı (u) (v + 1) + c2(u)(v + 1)? + - --, which translates to the Minkowski-spacetime 
action 


Seri} = J dx {co(z — m) + ci (z — 1) [20,0 + (8y70)"] 
(13.97) 


+ c2 (z — m )[29; m + (ður) } aeri 


This is the low-energy EFT we have been looking for. It remains to elucidate the 
consistency constraints on the functions c;(z — x). To that end, we expand the 
Lagrangian in (13.97) to second order in the NG field, 


Zel] = coz) + [-cp(z) + 2c1(2)3z]x (13.98) 
1 1" 1 2 2 2 
+ zlo — 2c) (z)dz]a~ + ci (z2) (Oum) + 4c2(z) (320) +. 
Bulk stability requires absence of any terms linear in x. This leads to the constraint 
coz) + 2c} (z) =0. (13.99) 


In fact, one can demand even co(z) + 2c1(z) = 0, since any constant offset of co(z) 
can be dropped from (13.97). This sharper relation is indeed satisfied by the choice 
of f (u, v) in Example 13.10, where co(u) = —[g'(u)]? and cı (u) = (1/2)[g’ (Ww). 
With the constraint (13.99), the first term on the second line of (13.98) automatically 
drops, and the Lagrangian boils down to c1 (z) (ður)? +4c2(z)(0-7)* up to a surface 
term. A necessary (though not inevitably sufficient) condition for the corresponding 
energy functional to be bounded from below is therefore 


cı(z)> 0 and cı(z) > 4c2(z) foranyzeR. (13.100) 


So far, I have not assumed any specific choice or properties of the profile 
functions c; (z). The effective action (13.97) defines a generic EFT for the NG mode 
of spontaneously broken translations in the z-direction in a relativistic system. In 
particular, the functions c; (z) in (13.97) do not have to arise from a topologically 
nontrivial background such as a domain wall. However, if that happens to be the 
case, we expect them to be localized to a finite range in z. The NG mode will then 
correspond to a surface wave and propagate solely in the transverse directions. I 
would now like to use the general EFT (13.95) to shed some light on the physics 
of these surface waves beyond the noninteracting approximation (13.91). To that 
end, it is convenient to switch to the x” coordinates. Namely, a(x) measures 
the local fluctuation-induced displacement of the domain wall in the z-direction. 
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Geometrically, the dynamical state of the domain wall can thus be viewed as a 
hypersurface in R? defined by a constant value of 2. 

With the shorthand notation g = 7_,g, the determinant of this D-dimensional 
metric can be decomposed as det g = (8%%)~! det g. Here Gm is a projection of 
Suv to the (D — 1)-dimensional space of x", which is nothing but the metric on the 
domain wall induced by the bulk Lorentzian metric in R?. The factorization of det g 
follows from the mathematical properties of block matrices [24]. Augmenting this 
observation with the constraint co(z) + 2c1(z) = 0 turns the first line of (13.95) to 


TRE f dE vol(§)(%) 


x {oe + Es F a | ("Gy if es | 


(13.101) 


So far this is an exact rewriting of (13.95) except for the truncation of the power 
expansion in g** + 1. Now we take the limit of flat spacetime. Simultaneously, we 
restrict to fields independent of the z coordinate, as appropriate for the NG mode. 
This makes it possible to do the integral over Z independently of the NG field. We 
thus arrive at an effective (D — 1)-dimensional theory for the surface waves on the 
domain wall which, up to higher-order corrections, takes the form 


Sue = f aco f atte dt vol) (xL, +. (13.102) 


In this approximation, the EFT for the domain wall fluctuations is completely 
geometric and fixed by the induced metric, 


Omn = Sin — MmT WT , vol(g) = V1= gn OT . (13.103) 


The expression for vol(g) follows from the so-called Weinstein—Aronszajn identity 
for matrix determinants. The reader may recognize this as the Dirac—Born—Infeld 
(DBI) theory that we met previously in Sect. 10.3.1. All that is left of the profile of 
the domain wall is the overall prefactor in (13.102). 


The prefactor f dzco(z) measures, up to overall sign, the energy of the domain 
wall per unit transverse volume, that is its surface tension. For the class of 
models (13.75), it equals — f dz [g’ (z)]*; see also (13.90). In case of the double-well 
potential with the domain wall solution (13.79), this gives —(4/ 3)VAv?. For the 
cosine potential with the corresponding domain wall (13.81), one finds analogously 
—8mv?. 
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The (D — 1)-dimensional theory (13.102) obviously describes only the NG 
mode as we wanted; the gapped modes we found in Sect. 13.4.2 are gone. The 
price to pay is that the D-dimensional Lorentz symmetry is now realized in a 
nontrivial fashion beyond our spacetime symmetry paradigm; cf. (10.50). 

The fact that all the modes propagating in the D-dimensional bulk are 
gapped is essential for the validity of the EFT (13.102). There are physical 
systems where the bulk modes are naturally gapless, for instance when two 
immiscible superfluids are separated by an interface. In this case, integrating 
the bulk modes out makes the EFT for the surface waves on the interface 
nonlocal. This leads to a fractional-power dispersion relation of the surface 
modes, E(p) « | pl?’ 2 See [25] for a detailed discussion from an EFT 
perspective. 


13.4.4 Further Possible Applications 


In discussing spontaneous breaking of spatial translations, I have deliberately 
merely outlined the general approach and then focused on a specific type of system 
to illustrate it. There are however several natural, physically motivated modifications 
or generalizations of the setup. I will now at least briefly mention some of the 
possible avenues one might wish to follow. 


Periodic Modulation of Order Parameter Mathematically the simplest modifi- 
cation is one with a single, otherwise featureless real order parameter where the 
profile y(z), or co(z), is not spatially localized. The most interesting situation arises 
when ¢(z) is a periodic function of z. In this case the physics changes qualitatively. 
The eigenvalue problem (13.87) for linear fluctuations will have no bound states. 
Instead, its continuous spectrum will have a band structure. There should still be a 
NG mode of the spontaneously broken translations, but this will assume the form 
of a sound wave (phonon) on the crystalline background (z). In the limit where 
y(z) consists of widely separated localized kinks, the propagation of sound in the 
z-direction arises from tunneling between states localized on the individual kinks. 

The detailed structure of the dispersion relation of the low-lying phonon excita- 
tions will be no less interesting. Namely, it turns out that the spontaneously broken 
symmetry under spatial rotations forbids the cı term in (13.97). As a consequence, 
the derivative expansion of the effective Lagrangian starts at second order in 
longitudinal derivatives 0, but fourth order in the transverse gradient V, [16]. We 
already saw the same behavior in smectic liquid crystals (Sect. 13.3.2), and I will 
return to its physical implications in Chap. 15. 


Other Spacetime Symmetries Another natural possibility is to consider systems 
with a real order parameter modulated in one spatial dimension, but with a 
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different spacetime symmetry than Poincaré. This is relevant in particular for any 
condensed-matter system. As long as one has a concrete microscopic model, the 
analysis proceeds along the same steps as in Sect. 13.4.2. One can however also 
adopt the model-independent approach of Sect. 13.4.3 based on background gauge 
invariance. This requires coupling the theory to an appropriate spacetime geometry. 
A basic discussion can be found in Appendix A of [16]. For a more complete, 
if mathematically also more advanced, overview of non-Lorentzian geometry, 
see [26]. 


Additional Degrees of Freedom One might also want to study spontaneous break- 
ing of translation invariance alongside other, possibly internal, broken symmetries. 
The field parameterization required then follows the general standard nonlinear 
realization of Sect. 12.3 combined with the identification of the translation NG 
field à la Sect. 13.4.1. The background gauge invariance approach may again prove 
helpful. Its application however relies on our ability to simultaneously gauge all 
the relevant symmetries by adding suitable background gauge fields. This is not 
guaranteed a priori without further qualifications. Finally, there are also physical 
systems where spacetime translations are spontaneously broken in more than one 
direction. In this case, the idea behind Sect. 13.4.1 might still be possible to apply 
provided one has at hand a sufficient number of fields to parameterize the order 
fluctuations uniquely. 
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The first example of a Nambu—Goldstone (NG) boson I mentioned in the intro- 
duction (Chap. 1) was hydrodynamic sound. However, our subsequent exploration 
of spontaneous symmetry breaking (SSB) took us very far away from this initial 
target. Indeed, I focused almost exclusively on the quantum world. This had a good 
reason: many conceptual subtleties of SSB are rooted in the description of quantum 
symmetries in terms of operators on the Hilbert space of states. We then did much 
work to establish the methods of nonlinear realization of symmetry and effective 
field theory (EFT) for NG bosons. Eventually, we discovered that the leading order 
(LO) of the derivative expansion of the EFT for quantum systems with SSB is a 
classical field theory. In this chapter, we shall close the circle and see how the 
techniques we have developed can be applied to purely classical systems. 

To build the necessary intuition, I will start in Sect. 14.1 with a primitive toy 
model for elastic solids. While physically inadequate in the details, this is good 
enough to show that a classical medium may possess an emergent symmetry, 
reflecting its internal structure. A proper identification of such emergent symmetries 
is key for distinguishing thermodynamic phases of matter with otherwise identical 
microscopic dynamics, such as solids and fluids. The subsequent analysis is straight- 
forward and follows the standard workflow from nonlinear realization to effective 
actions. The nonlinear realization of a purely spacetime symmetry augmented with 
the emergent symmetry of classical matter is detailed in Sect. 14.2. In the following 
Sect. 14.3, I then work out explicit examples of EFTs for different phases of matter. 


14.1 Emergent Symmetry of Classical Matter 


Classical matter is characterized by the possibility to uniquely label its elements and 
individually track their evolution. While doing so, one has to carefully distinguish 
two different types of coordinates. The first type are the genuine spacetime 
coordinates, independent of whatever matter is present in the spacetime. The 
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Fig. 14.1 One-dimensional chain of point masses, connected by linear springs. The springs are 
only allowed to vibrate in the horizontal direction. The masses and spring constants may vary along 
the chain as indicated 


second type are the coordinates that label the individual elements of the medium. 
These are known in classical mechanics as body coordinates (see e.g. Sect. 8.1 
of [1]) or material coordinates (Chap. 4 of [2]). They capture internal variations 
of the material properties of the medium. The difference between the two types of 
coordinates is best elucidated by an example. 


14.1.1 Introduction: Spring Model of Elasticity 


One of the most basic mechanical models, illustrating a transition between mechan- 
ics of particles and continuous field theory, is the linear spring chain, shown in 
Fig. 14.1. For the sake of discussion, I will assume that the particles connected 
by the springs have generally unequal masses. Likewise, the springs themselves 
may have different spring constants, although I will for simplicity assume equal rest 
(unloaded) length a. Denoting the position of the i-th particle as x;, the Lagrangian 
of the system is 


L= 2 mix; sh (xi+1 — xi — a) E (14.1) 


The ground state of this system corresponds to a chain of point masses at rest, 
placed equidistantly at distance a between the nearest neighbors. One can choose 
the origin of coordinates so that in the ground state, (x;) = ia. We would now like 
to perform the continuum limit, assuming that a is very small. This requires that 
the displacement x; — ia from the equilibrium position only varies appreciably over 
distances much longer than a. I will replace the discrete index i with a continuous 
body coordinate X through i — X/a. The individual masses m; are replaced 
with the linear mass density ọ(X) via m; —> ago(X), and the spring constants 
ki with the local elastic (Young) modulus E(X) via kj — E(X)/a. Treating the 
Lagrangian (14.1) as a Riemann sum then leads to the continuous approximation 


= fax [e0 Hrs SECO xxx, i= ir| . (14.2) 


This is a field theory in one spatial dimension with the dynamical degree of 
freedom x(X, t). Provided both @e(X) and E(X) are positive for any X e€ R, 


14.1 Emergent Symmetry of Classical Matter 335 


which follows from their origin in the parameters m; and k;, the ground state 
is (x(X,t)) = X up to an additive constant. This residual freedom stems from 
spontaneous breakdown of spatial translations. Parameterizing the displacement of 
the medium from equilibrium by the field ¢ (X, t) = x(X, t) — X, the fluctuations 
around the ground state are governed by the equation of motion (EoM), 


Q(X) dgb(X, t) = dx[E(X)Ox G(X, 1)] . (14.3) 


In the limit of equal masses m; and spring constants k;, the functions ọ(X) 
and E(X) become constant. The EoM (14.3) then describes compression waves, 
propagating along the chain with phase velocity v = ./E/o. In this limit, the action 
for ¢ (X, t) features a set of emergent continuous symmetries that were not present 
in the original discrete model (14.1). The most obvious of these is the invariance 
under the “translation” X > X +e. A straightforward application of Noether’s 
theorem shows that the corresponding integral charge is, up to overall normalization, 


Pate f dX 004 (X, axO(X, 1) . (14.4) 


Despite the suggestive analogy with translation invariance, this is not the momentum 
carried by the oscillating masses of the original discrete chain. The latter rather 
equals 


P= X miki = fax o(X)de (X, t) . (14.5) 


The resolution of this puzzle is that in presence of a uniform classical medium, 
we have two different coordinates and consequently two different translation sym- 
metries. The position of the mass m; with respect to the laboratory inertial reference 
frame is given by the coordinate x;. Accordingly, a genuine spatial translation 
amounts to the shift x Ó x +. In the formulation (14.2) of our field theory, 
this acts like an “internal symmetry.” That is, it transforms the dependent variable 
x(X,t) while leaving the independent variable X intact. Applying Noether’s 
theorem recovers the integral momentum (14.5). This remains conserved for any 
choice of functions ọ(X) and E(X). That is because momentum conservation 
reflects the uniformity of the underlying space itself; it does not care about the 
properties of the medium. The corresponding local conservation law is equivalent 
to (14.3), which is the continuous limit of the Newtonian EoM for the point masses 
mi. 

The other integral charge, (14.4), is usually called pseudomomentum. This 
arises, as already stressed, from invariance under X = x + €. Given that X was 
introduced as a material coordinate, conservation of Ppseudo should reflect the 
uniformity of the medium. Sure enough, it was essential for derivation of (14.4) 
to assume that ọ(X) and E(X) are constant. The distinction between momentum 
and pseudomomentum has historically led to much confusion in the research on the 
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continuum mechanics of fluids. See [3] for a pedagogical account of some of the 
related subtleties. 


The comparison of momentum and pseudomomentum illustrates best the 
striking contrast between the fundamental spacetime symmetries and the 
emergent symmetries of matter. However, the field theory (14.2) also features 
other emergent symmetries than translations. Namely, for constant 0(X) 
and E(X) its action is invariant under the whole Poincaré group ISO(1, 1) 
of transformations acting on X,t. These include the “translations” of X, 
giving rise to conservation of pseudomomentum, and the usual time trans- 
lations, leading to conservation of energy. In addition, the theory has an 
emergent SO(1, 1) Lorentz invariance under “boosts” that mix X and t. 
It is straightforward to work out the corresponding conservation law but 
I will not do so since we will not need it. Finally, the dynamics of our 
nonrelativistic spring chain should be invariant under Galilei boosts. Under 
the corresponding transformations, x + x + vt with v being the velocity of 
the boost, the Lagrangian (14.2) shifts by a total time derivative. Similarly to 
spatial translations, this is a genuine spacetime symmetry that is present for 
any choice of @(X) and E(X). 


Before closing the discussion of our toy model, let us briefly look at its 
generalization to a higher number of spatial dimensions d. For illustration, it is 
sufficient to take d = 2 and consider a rectangular network of springs as in Fig. 14.2. 
The position of the particle m; į is now given by a two-component vector x; j. 
Assuming otherwise the same dynamics as before, whereby the potential energy of 


Fig. 14.2 Rectangular 
network of springs as a toy 
model of a two-dimensional 
elastic solid. All the point 
masses mj, ; are allowed to 
differ, as are the spring 
constants kl . and k? .. The 
rest (unloaded) length of all 
the springs is assumed to be 
the same and equal to a 
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each spring is a quadratic function of its extension, the Lagrangian becomes 


1 
L= 3 E miji; | [xi+1,7 — Xi,j| — a)” 


(14.6) 


1 2 
= zki (ei ja — xi, jl — a) | : 


The ground state of this model corresponds to a rectangular network of equidistantly 
placed particles at rest. One can choose Cartesian coordinates in the plane so that 
(xij) = (ia, ja). To perform the continuum limit, we make the replacement 
(i, j) > X/a and mi j > a’o(X), where X is a two-component vector of body 
coordinates and ọ(X) the local mass density. We also need two elastic moduli, 
kl, — E(X) and ke, — E(X). In terms of the displacement vector ¢’ (X, t) = 

x! (X, t) — X",! the Lagrangian of the resulting continuous two-dimensional theory 
reads 


2 1 2 _t 1 2 
L > | dX z2 lA X, H] z E19 (X, t)] 
(14.7) 


-le 2X, D) 
372 np (X, t) ¢ + 


The ellipsis stands for terms of higher order in ”; in d > 2 dimensions, the 
dynamics of the spring network does not resolve into a superposition of one- 
dimensional harmonic motions. Moreover, the model (14.7) does not correctly 
capture the physics of elastic solids even in the harmonic approximation. Namely, 
for purely transverse oscillations such that 0” /d.X” = 0 for each fixed r, there is no 
linear restoring force and the motion is strongly anharmonic. In spite of these flaws, 
the model (14.7) is sufficient to shed light on the fate of the emergent symmetries. 
We always find exact invariance under genuine spacetime translations, spatial 
rotations and Galilei boosts regardless of the choice of the functions ọ(X¥) and 
E\,2(X). This much is obvious from the discrete version (14.6) of the model. 
These symmetries reflect the properties of spacetime itself and are insensitive to 
the material structure of the medium. Should the medium be uniform, with constant 
o(X) and E1,2(X), we will in addition have invariance under continuous “internal 
translations” of X. This symmetry ensures conservation of pseudomomentum, and 
we can expect it to arise in the long-distance description of real crystalline materials. 
However, we cannot in general expect invariance under continuous “internal 


'T previously introduced the symbol X“[w, x] to indicate the shift of the spacetime coordinate x” 
under an infinitesimal symmetry transformation. In this chapter, I will not use this notation in order 
to avoid confusion with the conventional symbol X” for body coordinates. The same applies to the 
symbol F'[y, x], previously introduced to denote an infinitesimal transformation of a field y’. In 
this chapter, I will reserve the letter F for the function defining the LO effective Lagrangian. 
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rotations” of X, unless we demand E; = E> and restrict the Lagrangian (14.7) to 
the lowest order in the power expansion in ¢’. This suggests that the macroscopic 
properties of real crystals tend to be homogeneous but anisotropic even in the 
continuum limit a — 0, which agrees with empirical observations. The dependence 
of the Lagrangian on X” is then constrained by the continuous internal translation 
invariance and the discrete group of point symmetries of the crystal lattice. 


14.1.2 Emergent Symmetries of Solids and Fluids 


We shall now synthesize the above observations into a general setup for describing 
classical matter. Much of the discussion in this subsection is inspired by Chap. 4 
of [2]. The basic assumption is that we are dealing with a thermodynamic state 
where quantum correlations are limited to short distances, typically due to thermal 
fluctuations. It is then possible to identify elements of the medium that are mutually 
distinguishable and can be assigned unique labels. This requirement limits the 
validity of the EFTs developed below to distances much longer than the scales 
characterizing quantum correlations and the discrete structure of matter. 

The labels X’ on the medium elements take values from a target space M, which 
is typically some domain in the real space, R?. The domain would be finite for an 
isolated material object of a finite size. However, here we will mostly be concerned 
with properties of matter filling the entire space, where it is natural to take M ~ R. 
In both cases, the labels X’ represent comoving body coordinates, attached to a fixed 
medium element. The trajectory of the element is defined by giving its position as 
a function of time, x(X, t). This is the standard Lagrangian picture of continuum 
mechanics, which I used to formulate the toy models (14.2) and (14.7). In order 
to highlight the genuine spacetime symmetries, it is however more convenient to 
invert the relation between x” and X’. The time evolution of the medium is then 
specified by a map from the spacetime M to M that assigns to every point x“ € M 
the material coordinates X’ (x, t) = Xİ (x) of the element occupying this point. The 
uniqueness of the labeling of the medium elements is ensured by requiring that for 
any fixed time t, the map x” —> X! (x,t) is a diffeomorphism between space and 
M. The advantage of this picture is that the labels X’ take the same values in any 
reference frame. In other words, the functions X! (x) are scalar fields with respect to 
any spacetime symmetry. On the other hand, emergent symmetries acting solely on 
Xİ can then be treated as internal symmetries, that is point transformations on M. 


The explicit realization of the fields X i (x) depends on a choice of coordinates. 
Importantly, the coordinates x” in the spacetime and the material coordinates 
Xİ can be fixed independently of each other. See Fig. 14.3 for a visualization 
of this freedom. In the following, I will deal exclusively with matter in flat 


(continued) 
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Fig. 14.3 The time evolution 
of the classical medium as a 
map fromthe spacetime Mto 8 fer Nee 
the target space M of body 
coordinates. The spacetime 
coordinates x“ in M and the 
body coordinates XÍ in M 
can be chosen arbitrarily and 
independently of each other 


R4 
— ýy ç >“ 
X'(x) 


spacetimes. Accordingly, I will use standard Minkowski coordinates x” for 
relativistic systems, and Cartesian coordinates x” augmented with Newtonian 
time ¢ for nonrelativistic systems. This still leaves us with the freedom to 
choose the body coordinates X i at will, which should be distinguished from 
any internal symmetry acting on M. In Sect. 14.1.1, I fixed this freedom 
tacitly by the definition of X” in terms of the discrete labels i, j. In order 
to avoid misunderstanding and to highlight the scalar nature of the body 
coordinates, I have now changed the notation from X” to X?. 


The scalar fields Xİ (x, t) are the generalized coordinates of our continuum field- 
theoretic description of classical matter. The corresponding generalized velocity is 
9X! (x, t). How is this related to the actual local (Eulerian) velocity of the medium 
with respect to the laboratory frame? Recall that Xİ are comoving labels that cannot 
change along the trajectory of the medium element. Parameterizing the trajectory 
as x(t), the condition dX’ (x(t), t)/dt = 0 leads to dg9X' = —x"d,X', where x = 
dx /dt is the desired local velocity. The matrix M p (x, t) = 3- Xİ (x, t) is invertible 
thanks to the assumption that the map x” —> X! (x, t) is a diffeomorphism. It follows 
that 


x” = —(M7!)" 3X . (14.8) 
There is also another, more elegant albeit slightly more mathematically advanced, 


way to express the local medium velocity in terms of derivatives of X! (x). First pick 
any function f(X) on M and define the current 


“Md esi, Oy, X! +++ By, X"4 | (14.9) 
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By the myama of the Levi-Civita (LC) spuibar Jy A ðu X' = i — 0 for any i. Expan- 
ding this as JpaoX' + J; M', = 0 gives Jy = -JẸ (M- Dr, do. X!. Using (14.8), we 
can thus rewrite the cunni as 


= (JẸ, JP). (14.10) 


It also follows from the antisymmetry of the LC symbol that the current (14.9) 
is conserved, 0,, J = 0, for any choice of the function f(X). It is therefore natural 
to identify J t = f(X) det M with the density of a conserved charge as observed in 
the faborito frame. The spatial part of the current, J} = =J; °%", then represents 
the flow of this charge. Finally, f(X) itself is the density of the same charge 
in the comoving material coordinates, since det M is just the Jacobian of the 
transformation between the spacetime and material coordinate systems. 


The origin of the currents J% can be traced back to the fact that the individual 
elements of our classical medium are distinguishable. Indeed, we could in 
principle attach to each element a dedicated “charge.” All such charges would 
be conserved by construction. The freedom to choose the function f(X) 
in (14.9) at will is just a continuous version of this observation. Operationally, 
the existence of infinitely many conservation laws, one for each f (X), is not a 
problem. Namely, J% are not Noether currents in that their conservation does 
not require the EoM. The presence of such identically conserved currents does 
not constrain the local classical dynamics of the medium. 

The above said does not mean that the currents J/* are inevitably artifacts 
of the Lagrangian picture of continuum mechanics that are not macroscopi- 
cally observable. Suppose our medium carries a conserved charge. This could 
count for instance the number of particles or, in the nonrelativistic limit, 
their mass. By choosing f(X) as the density of the charge in the material 
coordinates, we get a current J i that describes macroscopic flow of this 
charge. 


Mathematically, the currents (14.9) descend from the d-forms f (X)dX DASA 
dX? on M. Being top-dimensional, any such d-form is automatically closed. The 
closedness is preserved when the d-form is pulled back to the spacetime by the map 
x” —» X! (x). Taking the Hodge dual of the ensuing closed spacetime d-form then 
gives the current J% with vanishing divergence. 

Having established the basic setup, we are now finally in a position to discuss 
the emergent symmetries of various types of classical matter. I will focus on the 
two most common classical phases of matter: crystalline solids and fluids. Since 
this will to a certain extent merely collect and organize some observations I have 
already made previously, I can afford to be brief. 
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Solids We will be interested in the quasiequilibrium dynamics of crystalline 
solids at distances much longer than the lattice spacing of the microscopic crystal 
structure. In this regime, one can expect the solid to be internally uniform, and 
thus be symmetric under the group R? of continuous internal translations. These 
translations take a particularly simple form in “Cartesian coordinates” on M, 
namely X! +> xi +e. Barring possible quasi-invariant contributions, the effective 
Lagrangian can then only depend on Xİ (x) through their derivatives. In addition, 
the dependence on X‘ (x) is constrained by the point symmetry of the crystal lattice. 
This can be imposed order by order in powers of X’ (x) using tensor methods. I 
will not go into detail here, and rather refer the reader to Sect. 10 of [4] for further 
discussion. For the sake of illustration, I will only deal with the special case of 
isotropic solids, which feature a full SO(d) symmetry under continuous rotations 
of XÏ (x). Exact, full isotropy cannot really be achieved for any crystalline material. 
However, some polycrystalline materials are effectively isotropic at distances much 
longer than the typical size of a grain, thanks to the random orientation of the grains. 


Fluids The key difference between solids and fluids is that in the latter, there are no 
restoring elastic forces that would counteract shear strain. This enables macroscopic 
flow, whose physics is largely affected by dissipation. Unfortunately, dissipative 
effects are notoriously difficult to include in Lagrangian field theory. In order that the 
EFT for fluids we construct be meaningful, we thus need to make some simplifying 
assumptions. Namely, we shall restrict ourselves to physical processes that do not 
involve entropy production. Also, we will only consider states of the fluid that are 
perturbations of a uniform equilibrium where the entropy density in the material 
coordinates is constant. Such fluids are called barotropic; their thermodynamic state 
can be described by a single variable such as pressure or density. Thus, our EFT 
will be able to capture processes such as sound propagation, but not, for instance, 
convective heat transfer due to an initial temperature gradient. 

By definition, the local density of energy and entropy, and thus also of any other 
extensive property, of a barotropic fluid corresponds to a constant function f on 
M. At the same time, the macrostate of the fluid is completely specified by its 
local Eulerian velocity and density J? observed in the laboratory frame. The same 
should be true for the Lagrangian, which is directly connected to observables such 
as energy density and pressure. The field-theoretic description of fluids in terms of 
the scalar fields Xİ (x) should therefore be invariant under any point transformation 
on M that preserves the currents (14.9). This is not mere freedom of choice of 
coordinates on M. It corresponds to an actual physical reshuffling of the elements 
of the fluid that does not affect its macroscopic properties. Now J o = f det M 
requires that the point transformation on M preserves det M, and so must have a 
unit Jacobian. We conclude that the dynamics of a barotropic fluid possesses an 
emergent symmetry under the group of volume-preserving diffeomorphisms (VPDs) 
on M, SDiff(M). This is an infinite-dimensional symmetry group that includes the 
internal translations and rotations of solids as a subgroup. See [5] for a detailed 
account of the role of the diffeomorphism group in hydrodynamics, and [6,7] for 
two somewhat complementary overviews of the variational approach to fluids. 
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14.2 Nonlinear Realization of Emergent Symmetry 


We have done a serious amount of work to carefully identify the emergent 
symmetries of classical matter. Our effort will now pay dividends in that the 
next steps will be fairly straightforward. In the absence of degrees of freedom 
sensitive to other symmetries, the symmetry group of a classical medium is G ~ 
Gst. X Gint. Here Gst. is whatever spacetime symmetry is appropriate for the 
system at hand, such as Aristotelian, Galilei or Poincaré. The group Gint collects 
all the emergent symmetries that act on the scalar fields X‘(x) as internal. For the 
reader’s convenience, the choices of Gint in case of uniform solids and barotropic 
fluids are summarized in the first two columns of Table 14.1. The product structure, 
G = Gg. X Gint, of the symmetry group already appeared in Sect. 13.2, and we will 
be able to largely follow the path paved therein. 


14.2.1 Field Variables and Unbroken Symmetry 


To start with, the isotropy group of the spacetime origin is Ho ~ Hs +. X Gint, where 
H, +. collects all transformations from Gs.. that fix the origin (typically rotations 
and boosts). I will always assume that Gint includes a set of mutually commuting 
translations that act transitively on M ~ Rf. The isotropy subgroup A(Xo,0) is then 
the same, up to conjugation by an element of Gint, for any choice of the reference 
point Xe: It is convenient to set Xi = 0 so that H(x,0) ~ Ast. x Hint where Hint 
consists of all emergent symmetries that fix the origin in M. The concrete isotropy 
subgroups for solids and fluids are listed in the third column of Table 14.1. Note 
that in case of fluids, both Gint ~ SDiff(R%) and Hin ~ SDiffo(R“) are infinite- 
dimensional. However, the coset space Gint/ Hint 1s equivalent to R? for any choice 
of Gint thanks to the assumed transitive action of Gint on M. This reconfirms that 
the dynamical degrees of freedom of the EFT for classical matter will always be 
the d NG fields X’ (x). That is in contrast to the agnostic nonlinear realization 


Table 14.1 Overview of the emergent symmetries and related subgroups in different phases of 
classical matter. The Geryst symbol denotes the crystallographic point group of a crystal lattice. 
Also, SDiffo (R) is the group of VPDs in R? that fix the origin. The coset space Gint/ Hint is 
equivalent to R? in all cases. The unbroken subgroup Hg indicates the symmetry of a uniform, 
static equilibrium state. The subscript “diag” denotes diagonal symmetries whose actions on the 
spatial and body coordinates are locked to each other. The values shown in the last column 
assume that the subgroup of time-independent transformations in Gs.t., acting only on the spatial 
coordinates, is the Euclidean group ISO(d). The factor of R in Hy corresponds to time translations 


Material Emergent symmetry Gint Isotropy group Hint Unbroken subgroup Hy 
Solid Geryst x RI G cryst (Geryst x Rf )aiag xR 
Isotropic solid ISO(d) SO(d) [ISO(d)]diag x R 


Fluid SDiff(R2) SDiffo (R?) [ISO(d)]diag x R 
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(cf. Sects. 12.3.2 and 13.3.4), whose application to fluids would suggest an EFT 
with an infinite number of would-be NG fields; see [8] for details. 

Before we proceed to the construction of effective actions, let us check which of 
the symmetries in G actually are spontaneously broken. Suppose we are interested 
in the physics of fluctuations around a static equilibrium state, characterized by the 
time-independent vacuum expectation values (VEVs)” 


(Xi œ, Ð) = g (x). (14.11) 


Due to the product structure of G, an element (8gs.t., gint) E G transforms 
the spacetime and body coordinates as T, : (Xi, x#) > (XÏ, x”) = 
(Fi (X, gint), A“ (x, gs.t.)). Here the x”-independent functions Fİ realize the action 
of Gin on M, whereas the X ‘independent functions X” realize the action of 
Gs+, on the spacetime. The unbroken subgroup Hy consists of transformations 
preserving (14.11), 


F (g(x), hint) = p K(x, hst.)) , (hst., hint) € Hy - (14.12) 


This condition has the geometric meaning of equivariance of g' as a map M > M 
under the action of Hy, that is F og = go X. Since g'(x) is a diffeomorphism 
between the coordinate space and M, (14.12) defines a one-to-one correspondence 
between the maps Fi and X”. In this sense, Hg is the “diagonal subgroup” of G54. x 
Gint. It also includes those transformations from Gs, that act on x” trivially, that 
is time translations. An overview of the unbroken subgroups in solids and fluids is 
given in the last column of Table 14.1. 


14.2.2 Building Blocks for Construction of Effective Actions 


In the absence of other degrees of freedom, we are dealing with the d scalar fields 
Xİ (x).? All of these are of the NG type, realizing nonlinearly the emergent internal 
translations on M ~ Rf. Let us denote the generators of the translations as I. I will 
now finally fix the freedom to choose coordinates on M ~ Gint/ Hint X R? at will 
by parameterizing it as U(X) = exp(iX'TI;). This gives a precise definition of the 
previously mentioned “Cartesian coordinates” on M, in which the translations from 
Gint act on XÍ by trivial shifts, xi S xi +e, According to (13.24), the Maurer- 


? A different choice of the reference state of our classical medium might require a modification of 
the discussion below. The change would however not affect the general nonlinear realization setup, 
which only depends on the groups Gs.t., Hs.t., Gint and Hint. 

3 For relativistic systems, there is an alternative formulation of the EFT where the XŻ are completed 
to a Lorentz vector of dynamical variables. See Sect. 6 of [9] for an introduction and references. 
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Cartan (MC) form is then given by 
w(X, x) = MjdX' + P dx, (14.13) 


where P, generates spacetime translations. The spacetime coframe is trivial, that is, 
e*4 = §4dx”. Accordingly, it is not necessary to use different notations for frame 
and coordinate-basis indices. The covariant derivatives of the NG fields are simply 
Va X! = ðu Xİ. This makes it possible to use the same power counting as for 
superfluids (Sect. 13.2.2), whereby an n-th partial derivative of Xİ is assigned the 
counting degree n — 1. The LO of the EFT is thus defined by a Lagrangian density 
where every field X! carries exactly one derivative. 

Invariance of the effective action is ensured as follows. First, invariance under 
the whole internal symmetry Gint is guaranteed by using the MC form as a building 
block and imposing solely the linearly realized isotropy group Hint. Similarly, one 
has to impose explicitly invariance under the linearly realized spacetime isotropy 
group, Hs.. Finally, invariance under spacetime translations requires that the 
Lagrangian density does not depend explicitly on the spacetime coordinates. 


In case of fluids, the coset space Gint/Hint ~ SDiff(R?) jf SDiffo(R?) is 
not reductive, hence the line of reasoning using the transformation proper- 
ties (13.10) of the MC form does not necessarily apply. In such a situation, we 
may have to impose by hand invariance under the entire emergent symmetry 
group Gint. Luckily, we know a priori how VPDs act on the coordinates X i 
and thus also on the 1-forms dX‘ on M. This will make it possible for us to 
construct an EFT for fluids in Sect. 14.3.4. 


14.3 Effective Field Theory of Classical Matter 


I will now carry out the above-outlined program for several types of physical 
systems of interest. I will start with what in many ways is the simplest case: an 
isotropic relativistic solid. Having warmed up, we shall then have a look at the 
phenomenologically more relevant nonrelativistic solids. At the very end, we will 
return to the most nontrivial case of fluids with their infinite-dimensional symmetry 


group. 
14.3.1 Relativistic Solids 
Consider a system with relativistic microscopic dynamics that settles to a thermody- 


namic equilibrium state with the symmetries of an isotropic solid. The full symmetry 
is Gs.t. X Gint ~ ISO(d, 1) x ISO(d), where the first factor is the spacetime Poincaré 
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group and the second one is the emergent internal symmetry. With one derivative per 
field, the only way to obey the linearly realized Lorentz symmetry, Hs, ~ SO(d, 1), 
is to pairwise contract indices on 0, X i It follows that the LO effective Lagrangian 
is some function of the Lorentz-invariant matrix operator 


gi = 3 X Ot X! . (14.14) 


It remains to impose invariance under Hin ~ SO(d). Since BÏ isa symmetric 
tensor, any Hint-invariant function of S'/ can only depend on its d real eigenvalues. 
These can be encoded in any set of d algebraically independent invariants con- 
structed out of B}, for instance the traces of the first d powers of B°. The LO 
effective action for relativistic isotropic solids then reads 


SX) = f a?s Fe B(x), EGP, (ay!) +> f (14.15) 


where F is a function of the displayed arguments and the ellipsis indicates 
corrections of higher order in the derivative expansion. These include operators 
with on average more than one derivative per X’, and Lorentz and internal indices 
contracted in a way respecting the Hs.. x Hin ~ SO(d, 1) x SO(d) symmetry. 

With all the background we had built up, the path to (14.15) was very short. 
There are however a couple of potential loopholes to close. First, I tacitly assumed 
the effective Lagrangian to be strictly invariant under all the symmetries. Can there 
be any quasi-invariant contributions to the Lagrangian? Here I refer the reader 
to [10], which showed that the symmetries of a relativistic isotropic solid do not 
allow any genuinely quasi-invariant Lagrangians in d = 2 or 3 spatial dimensions. 
It is plausible to assume that the same conclusion holds for any d > 2. 

Second, we still need to check whether the tentative equilibrium (14.11) is a 
stable state of the EFT (14.15). To start with, note that the LO EoM of the EFT is 


aF l 
On ( arxi) =0. (14.16) 


iow 


In a uniform equilibrium, the VEV of the invariants (14.14), (B) = -Vø - Vot, 
should be coordinate-independent. The EoM then implies that the functions g'(x) 
must be harmonic, V7g! = 0. Acting with the Laplace operator on (EŻ), we 
get in turn that (3- Vİ) . (3V ot) = 0. Setting i = j herein shows that all 
partial derivatives of Vy! vanish. Hence, g'(x) is a linear function of coordinates, 
g (x) = M! xe +c! with a constant invertible matrix Mi , and constant c! . The set of 
constants cÍ can be removed by shifting the origin of coordinates. Finally, suppose 
that the generators I]; of internal translations were chosen as a basis of the standard 
vector representation of Hin ~ SO(d). By using the singular value decomposition 
augmented with appropriate orthogonal rotations of the fields X' and coordinates 
x”, the matrix Mi, can be made diagonal and positive-semidefinite. Invariance 
under the unbroken diagonal SO(d) rotations of spatial and body coordinates then 
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requires that M i, x ôi. Finally, the proportionality factor can be absorbed into 
a simultaneous rescaling of the body coordinates XÍ and the generators [;. We 
therefore conclude that, without loss of generality, the uniform equilibrium of the 
solid can be described by 


(XÏ (x, t)) = dix" . (14.17) 


Below, I will implicitly assume the same ground state for both nonrelativistic solids 
and (relativistic or nonrelativistic) fluids. 

Further constraints on the parameters of the effective Lagrangian arise from 
requiring that the state (14.17) is (meta)stable with respect to small fluctuations. 
To that end, we parameterize the fluctuations of X! by a set of NG fields, z’ (x) = 
Xİ (x) — Six". In terms of these fields, we have 


av = 8 — (8a nt +87 apni) + aurian) . (14.18) 


It is now a matter of straightforward algebra to expand the action (14.15) to second 
order in x’ and compute the excitation spectrum. The reader is most welcome to do 
this exercise, or check [11] where the special case of d = 3 spatial dimensions is 
analyzed. I will not work out the details here, since I will address the same problem 
in the arguably more realistic setting of nonrelativistic solids below. 


14.3.2 Nonrelativistic Supersolids 


Most naturally occurring solid materials lie safely within the nonrelativistic domain. 
It therefore appears more appropriate to demonstrate the utility of our EFT 
formalism in a setting that connects more directly to classical theory of elasticity [4]. 
However, in order that the discussion below is not a mere rehash of Sect. 14.3.1 
using nonrelativistic notation, I will add a new physical ingredient. Namely, 
some materials are known to enter at low temperatures a quantum phase called 
suggestively supersolid. In this phase, matter exhibits a combination of crystalline 
solid order and superflow. The latter stems from the presence of an internal, 
spontaneously broken U(1) symmetry. Once we have found an effective action for 
supersolids, we will be able to recover an EFT for ordinary solids by decoupling the 
superfluid NG boson. The content of this subsection is heavily influenced by [12]. 
The mathematical setup closely follows the discussion of nonrelativistic superflu- 
ids in Sect. 13.3.3, to which I refer the reader for details. In order to avoid excessive 
cross-references, I will however spell out the main features of the setup explicitly. 
The full symmetry of the nonrelativistic supersolid is given by the Bargmann group 
augmented with the emergent ISO(d) symmetry of a classical isotropic solid, that is 


G ~ SO(d) x {R} x [R? x U(1)ọ]} x ISO(d) . (14.19) 
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Here the SO(d) factor represents spatial rotations, R3 Galilei boosts, R? spacetime 
translations, and U(1)ọ the internal symmetry counting the number of particles. The 
symmetry group does not have the simple direct product structure Gs.t. X Gint. This 
is due to the dual nature of U(1) g, which is an internal symmetry but simultaneously 
centrally extends the spacetime Galilei group. We will however still be able to follow 
the philosophy of Sect. 14.2 with minor modifications. In particular the spacetime 
isotropy group Ho is in this case Hyp ~ [SO(d) x R4] x U(1)ọ x ISO(d). 

In order to be able to apply our basic EFT framework for spacetime symmetry, we 
need to realize the Galilei boosts nonlinearly. To that end, we need a Galilei vector 
order parameter A” = (A?, A), choosing a timelike reference point, Ab = (a, 0) 
with a # 0. This is accompanied by two other order parameters, a complex 
scalar y charged under U(1)g, and the body coordinates X i. Taking any Wo 4 0 
and X$ = 0 leads to Hyo, A0, X0),0) * SO(d) x SO(d), where the two factors act 
respectively on the spatial and body coordinates. The coset space relevant for the 
nonlinear realization of the symmetry is Ho/ A (yp, Ao, X0),0) S RI x U(I)ọ x R?. 
The last factor of R? is new compared to Sect. 13.3.3 and carries the solid degrees 
of freedom. The whole coset space is parameterized by the NG variables 7, €” and 
Xİ through 


U(x, &, X) = e" eK expGX'TI;) , (14.20) 


where Q is the generator of U(1)g and K” that of Galilei boosts. The MC form is 
calculated using the commutation relations of the Bargmann group, 


olr, E, X, x,t) = Olda —&- dx + (1/2)€*t] + K - dé + MjdXx! 
+ Hdt + P - (dx — &dt) . 


(14.21) 


The second line herein defines the spacetime coframe, e*2 = dr and e*” = 8£(dx* — 
&*dt). This allows one to extract the covariant derivatives of all the NG fields from 
the w part of the MC form, 


Vor = don + £ -Vr —&7/2, Va = ðr — & , 
Vie = (ðo + £ - Ve , Vé” = 3" , (14.22) 
VoXÍ = (ðo + £- V)Xİ , V-X! =ð X'. 


For simplicity of notation, I have already dropped the underscores distinguishing 
spacetime and frame indices. 

The would-be NG field £" (x) is unphysical and can be eliminated by imposing 
the inverse Higgs constraint (IHC) V,m = 0, or &(x) = Va(x). This assigns €” 
the counting degree zero. As a consequence, V,” has degree one, whereas both 
Var and V, X i are of degree zero. The building blocks we have to construct the LO 
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effective Lagrangian are therefore 


Vor — don + (Vx)?/2, VoX! > (9 + Va-V)X', V,Xİ > a,X!. 
(14.23) 


These take automatically care of the nonlinearly realized symmetries under Galilei 
boosts, U(1)g transformations and internal translations of X ‘To ensure invariance 
under spatial rotations, note that both Voz and VoX i are scalars, whereas V, X’ is a 
vector under spatial SO(d). All spatial indices thus have to be contracted into 


si = "y, XİV, Xİ = VXİ - VXİ . (14.24) 


This symmetric tensor is related to the previously defined matrix M 7 (x) = 0,X! (x) 
by Š = MM”. Note that there is no additional, algebraically independent rotation- 
ally invariant operator where the spatial indices on V, X’ would be contracted with 
the LC symbol, since €!” V, Xi... -V X" = eld det M = e't dy det È. 
Altogether, the LO effective action for nonrelativistic supersolids is given by [12] 


Sert, X} = fers F(VYor(x), VoX (x), E) +. (14.25) 


The indices on VoX! and Č} are to be contracted in a way that respects invariance 
under the internal SO(d) rotations. 

Again, I have not explicitly considered the possibility of quasi-invariant contri- 
butions to the Lagrangian. There are d + 1 such terms, constructed solely out of 
the superfluid mode z; cf. Sect. 13.3.3. However, these are only relevant beyond the 
LO of the derivative expansion. An additional quasi-invariant Lagrangian existing 
in d = 2 dimensions and including the solid variable X i was found in [10]. This 
likewise contributes only at higher orders of the derivative expansion. 


14.3.3 Nonrelativistic Solids 


Let us now see whether we can recover the physics of classical (isotropic) solids 
from the supersolid EFT (14.25). It appears we should be able to simply erase 
the NG field x(x). This certainly does not interfere with the nonlinear realization 
of the symmetries of solids on X', since that is entirely independent of x. The 
building blocks (14.23) then boil down to d9X i and 0,X‘. Invariance under spatial 
rotations again requires that 0, X' only enters the LO EFT through the combination 
Ši, However, the time derivative, 9X’, is potentially problematic since unlike 
VoX', it is not invariant under Galilei boosts. This actually makes sense: recall 
that the nonrelativistic kinetic term in (14.2) or (14.7) is only quasi-invariant under 
Galilei boosts. Guided by the analogy, the kinetic term for the solid should be 
(1/2)o(x, t)[x(X, t)]?. Here x(X, t) is the Lagrangian variable we worked with in 
Sect. 14.1 and ọ(x, t) is the local mass density of the solid in the laboratory frame. 
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Now ọ(x) = oo det M(x) = Qo,/ det E(x), where Qo is the mass density of the solid 
in the body coordinates, which is constant thanks to the assumed uniformity of the 
solid. Using finally (14.8), the LO effective action for the classical nonrelativistic 
isotropic solid can be written as 


Serf X} = / dx [È det LÉE jox ()90X/ (x) 


- FO} . 


(14.26) 


The function (=) defines the free energy of static elastic deformations of the 
solid. Our derivation of the kinetic term in (14.26) was based on an educated 
guess. It can however also be recovered by starting from an exactly Galilei-invariant 
operator in the EFT (14.25) for supersolids and adding a surface term, proportional 
to the gradient of x(x). This explains why setting z — 0 does not destroy Galilei 
invariance altogether but renders (14.26) quasi-invariant. See [12] for details. 

I will now review the basic elastic properties of isotropic solids, following 
closely [4]. As in Sect. 14.1.1, the displacement of the solid element carrying label 
Xİ from its equilibrium position is given by ¢' (X, t) = six" (X, t) — Xİ. This 
parameterization is however not suitable for an EFT where the scalar fields X i(x) 
are the dynamical variables. We need to invert the relation between x” and ¢', which 
is not possible in a closed form. Luckily, we will only need the deviation of XÍ from 
its VEV to first order in o F 


Xİ (x) = dix” — p (x) + O’) . (14.27) 
To the same order, the matrix variable EY reads 
BY (x) = 8 — eapi (x) + 5/7 ao (x)] +O’) . (14.28) 


With the shorthand notation ġ, (x) = bid! (x), small deformations of the solid are 
therefore encoded in the strain tensor 


1 
ers (x) = 5 lords (x) + dsr Œ]. (14.29) 


By its definition, ¢' (x) transforms as a vector under the unbroken diagonal SO(d) 
subgroup. The strain tensor therefore behaves as a rank-2 symmetric tensor. 

The energy cost of a small static elastic deformation of the solid is governed 
by the expansion of ¥(&) in powers of E} — 6’/, or directly the strain tensor. 
The expansion cannot contain a linear term, as that would immediately imply 
an instability of the equilibrium state where (¢/(x)) = 0. The physics of small 
elastic deformations is therefore dominated by the part of 2 (Č) quadratic in the 
strain tensor. Invariance under unbroken diagonal rotations admits two independent 
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quadratic operators at the lowest order in derivatives, 
T 2 À 2 
F = utre tore) +. (14.30) 


Here à, u are known as the Lamé coefficients, and the ellipsis denotes contributions 
of higher order in ¢’ or derivatives. It is convenient to rewrite (14.30) as 


2 
tr K 
# =ute(e-"21) $y hey f (14.31) 


where K = à + (2u /d). Importantly, there are configurations of the solid for which 
either of the two terms in (14.31) vanishes whereas the other is nonzero. The u- 
term is zero if and only if e,s is proportional to 6,5, that is for uniform dilatations or 
contractions of the solid. Such overall volume change is detected by tre = V - @, 
which measures the deviation of the Jacobian det M from unity. For this reason, the 
parameter K is usually called bulk modulus. On the other hand, for deformations of 
shape that do not affect the volume of the solid, the K-term in (14.31) vanishes. The 
energy of such deformations is measured by the shear modulus m. Since the two 
terms in (14.31) can be set to zero independently of each other, bulk stability of the 
solid requires that 


K>0, u>o0. (14.32) 


The constraints (14.32) may look trivial, but they have very observable conse- 
quences for small oscillations of the solid. To see this, it is sufficient to consider 
the part of (14.26), bilinear in the fields ø’ (x), 


À 
Lett = Sar dopr Bods = [nre + are? | +o (14.33) 
The corresponding EoM reads 


O0d5br ~ UVP, + (A+ WIV A, (14.34) 


where the symbol ~ indicates a linear approximation. This has plane-wave solutions 
of the type 


dy (x,t) = bye eel? , (14.35) 
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where by is an amplitude. Solutions for which $ || p and ĝ L p represent respecti- 
vely longitudinal and transverse sound. The corresponding phase velocities are 


+2 K +2u(1 —1/d 
na feel EAUS. iene (14.36) 
20 Q0 Q0 


The stability condition K > 0 implies a specific hierarchy between the velocities of 
longitudinal and transverse sound, 


we fy (1 _ 3) (14.37) 


UT 


This relation would have been completely invisible, had we focused solely on the 
spectrum of sound waves itself. When looking for the constraints on parameters of 
an EFT imposed by the stability requirement, it is therefore all-important to consider 
the whole configuration space of the EFT. 


14.3.4 Perfect Fluids 


As the last stop of our exploration of classical matter, we now return to fluids. I 
will restore full Poincaré invariance, partially because relativistic fluids are common 
in astrophysics and particle physics, and partially because it makes the analysis 
simpler. 

As argued in Sect. 14.3.1, the NG fields X‘(x) necessarily enter the effective 
Lagrangian through the combination E} = ðu X ‘g“XJ, However, not all actions 
of the solid type (14.15) are consistent with the required invariance of the fluid 
action under Gint ~ SDiff(R? ). The transition from solids to fluids can be viewed 
as a “fine-tuning” of the effective couplings of the former. There is only one 
algebraically independent function of BÙ that fits the bill, namely det E = |E]. 
Thus, the LO effective Lagrangian of a relativistic perfect (barotropic) fluid reads 


Sett{X} = f ars F(E W+. (14.38) 
With the help of the identity 3|E|/ðE" = |S|(8~');;, we readily extract the 
corresponding LO EoM, 

du[F’ (|S) |S] (S7"),0"X/] =0, (14.39) 


where the prime indicates a derivative of F'(|&|) with respect to its argument | &]. 
We could in principle stop here, for we have accomplished our goal to construct 
an EFT for perfect fluids. However, to shed light on the physical content of (14.38) 
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and (14.39) requires additional work. The first step is to derive the energy- 
momentum (EM) tensor of the EFT. This is a standard problem (see for instance 
Example 4.3),* the result being 


TY” = 2F'(\S|) (S| (€73 Xa XI — g” F(E). (14.40) 


This already tells us, upon some manipulation, that the EoM (14.39) is equivalent 
to the local conservation laws for energy and momentum. The next step is to trade 
the explicit dependence on the derivatives 3, X i for the local Eulerian velocity of 
the fluid. To that end, consider the current (14.9) with f(X) = 1, 


1 i l 
Je = JË = e eiin AX e yX. (14.41) 


It is a matter of straightforward algebra to verify that J” J, = (—1)4 |S]. The extra 
sign compensates for our set of conventions, in which (E) = —8™ but (J) = 6”. 
The spacetime vector 


JE 
ut = —> (14.42) 


v(-D418| 


is then normalized to unity, u? = 1. Moreover, according to (14.10), its spatial part 
is proportional to the local Eulerian velocity x of the fluid. It follows that u” is the 
velocity spacetime vector as observed in the laboratory frame. 

Consider now the symmetric tensor Guy = (S74); ðu Xid XI + Uj,Uy. Since 
ðu X ‘ as a spacetime vector is orthogonal to J“ and thus u“, it follows that G uvu” = 
u,,. At the same time, it is easy to see that Guð” X! = 0,X'. Together, ə” xX! 
fori = 1,...,d and u” constitute a basis of D linearly independent vectors on 
the spacetime, hence Guy A” = Ay = gy A” for any spacetime vector A”. This 
implies the useful identity 


(B71) 3p X OXF = guy — Upwey , (14.43) 

which allows us to rewrite the EM tensor (14.40) as 
TY = 2F' (E) [E] (gt? — uu”) — gt F((E)) . (14.44) 
In an inertial reference frame locally comoving with the fluid (local rest frame), the 
EM tensor should be diagonal and isotropic, T#” = diag(U, P,..., P). Here U is 


the local energy density and P the pressure of the fluid. This can be written in a 
covariant form as T#” = (U + P)u”u” — Pg”. The scalar functions U and P 


4I use an opposite overall sign of the EM tensor as compared to Example 4.3 to ensure that the 
T° component coincides with the canonical Hamiltonian of the EFT. 
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can be projected out of the EM tensor using T’’u,,uy = U and Ty =U —dP.It 
follows that the as yet unknown function F (| E|) is related to the energy and pressure 
by 


U(|S|) = —F(\S)) , P(E) = F((8) — 2F'((8) 8] . (14.45) 


To check that we have got the basic physics right, let us use our EFT to calculate the 
speed of sound in the fluid. For that, we need to find the bilinear part of the effective 
Lagrangian in the fluctuations x’ (x) = Xİ (x) — six" . I will outline the main steps 
but skip straightforward details. First, the fluctuation of E} itself is given by (14.18), 


sv = sh aE, 8EY = —(6''0,0/45/"5,0')+d,0'd4"a/. (14.46) 


Using the identity det & = exp tr log &, we next obtain the expansion 
1 
(-1)4 |S) =1-—t8e+ s[e? — tr(8E?)] + OSE?) . (14.47) 
What remains to be done is just a Taylor expansion of the Lagrangian, Yor = 
F(|S|), in’. Dropping a constant and a surface term, we find 


Lat ~ —(—1)4 FA (Bon)? + (D F} + 2FV MV ox)? H, (14.48) 


where Fo and Fy are derivatives of F (|E|) taken in the equilibrium, (| €|} = (=1)4, 
Note that (14.48) does not contain any transverse gradients of 2‘(x). This is 
expected, saying merely that transverse fluctuations do not propagate via harmonic 
oscillations due to the absence of shear elastic forces in a fluid. Fluids support only 
longitudinal sound waves, whose phase velocity follows as 


F! 
vwu = | 1+2). (14.49) 
Fo 


It is easy to see with the help of (14.45) that this equals y P’(|S|)/U’(|S]) = 
/dP/dU, as expected for thermodynamic sound. 


Another informative check of our EFT is to take the nonrelativistic limit. To that 
end, we use the relation (14.45) between the function F (|2|) and the energy density 
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U(|S|) to decompose the former as 


F(E) = —aoV (-1)4 || — F (Df |El) . (14.50) 


The constant @9 is the mass density of the fluid in the body coordinates. In the 


local rest frame, ooy (—1)4|E| > oo where 5/ is defined by (14.24), is the 


density of the rest mass of the fluid. This is expected to dominate the energy in 
the nonrelativistic limit. On the other hand, F (=D? 1E) —> F (|Ž|) becomes 
the elastic free energy of the fluid. Its contribution to F(|E]|) is assumed to be 
suppressed relatively to the leading term in (14.50) by two inverse powers of the 
speed of light. 

In the laboratory frame, we write E} = a)X'd)X/ — Š and again use the 
identity det & = exp tr log &. This gives 


T 
o) 


(CD? |El = ŽI [1 — (871); 0X! aX! +], (14.51) 


where the ellipsis stands for higher-order corrections, negligible in the nonrelativis- 
tic limit. In turn, (14.45) becomes 


a C0 Fea i j A 
U = ooy|8| — Z VEET oX oX + FEN, 
2 (14.52) 
P = -F (Š) +2F' (ŠD + 


This has a simple interpretation. The three contributions to U correspond, up to 
a sign, to the relativistic rest energy, the nonrelativistic kinetic energy and the 
thermodynamic (internal) energy, while P is the nonrelativistic pressure. The same 
expansion converts the action (14.38) to 


SalX) = f ax { ~ eo VEO EVEI yo oX 
- Fa} +. (14.53) 


This copies the EFT (14.26) for nonrelativistic solids that we previously obtained 
by an educated guess. The only difference is that the gradient free energy F is 


now allowed to depend only on | S|. (The go 


žo) term gives upon integration a 


constant and can be dropped.) The EFT (14.53) for nonrelativistic fluids inherits the 
symmetry under VPDs of the target space M from its relativistic ancestor (14.38). 
The spacetime part of its symmetry is however different, and includes spacetime 
translations, spatial rotations, and Galilei boosts. 
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The symmetry of fluids under VPDs, Gin ~ SDiff(R? ), implies the existence 
of infinitely many conserved currents. Indeed, the action of an infinitesimal VPD 
can be expressed as 8X! = eV'(X), where € is a parameter and Vi(X) any 
smooth vector field on M ~ R? with vanishing divergence, V'(X)/dX' = 0. 
A straightforward application of Noether’s theorem gives the current 


JG = F'((8) |S] (87) ja" X! Vİ (X) . (14.54) 


Its on-shell conservation is also seen as a direct consequence of the EoM (14.39). 
The conservation of the corresponding integral charges generalizes the so-called 
Kelvin circulation theorem to relativistic fluids; see [13] for further details. 


14.4 Coupling Nambu-Goldstone Bosons to Classical Matter 


Before we wrap up the discussion of EFTs for classical matter, let us step 
back to see how it connects to the rest of the book. We started the analysis of 
spontaneously broken spacetime symmetries with a fairly general construction of 
nonlinear realization thereof in Chap. 12. However, in its subsequent applications, 
we gradually increased the assumptions on the symmetry and the order parameter. 
In the present chapter, we ended up discarding altogether the possible presence of 
NG modes associated with long-range order in the quantum ground state. Here 
it is possible to tie up some loose ends with little effort. An attentive reader 
might have noticed two related observations I made in quite different contexts. In 
Sect. 13.3.3, I pointed out that it might be possible to make an EFT with Aristotelian 
symmetry invariant under boosts, whether Galilei or Lorentz. All one needs is 
an auxiliary variable representing the velocity of the medium in which the EFT 
lives. On the other hand, we learned in Sect. 14.1.2 that this velocity is naturally 
encoded in the identically conserved current (14.9). Following this link, I will now 
sketch concretely how an Aristotelian EFT for broken internal symmetry can be 
made boost-invariant by coupling the EFT to a classical medium. A general EFT 
framework for such hybrid systems does not seem to exist as yet. For the sake of 
illustration, I will restrict the discussion to systems where the classical medium is a 
fluid. The construction is loosely inspired by Pavaskar et al. [14]. 

Instead of lengthy reminders, I refer the reader to Chap.8 for the details 
of construction of EFTs for broken internal symmetries. (See Sect. 8.1.4 for an 
executive summary.) All these EFTs were built assuming Aristotelian symmetry, 
that is symmetry under spacetime translations and spatial rotations. There are two 
details we have to attend to if we want to augment this spacetime symmetry with 
boosts. First, the temporal and spatial derivatives, which appear independently in 
the general Aristotelian effective Lagrangian (8.33), have to be combined in a 
way that respects boost invariance. Second, any operator invariant under all the 
spacetime and internal symmetries can be multiplied with an arbitrary function of 
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| S| (for relativistic fluids) or || (for nonrelativistic fluids). From this point on, it is 
convenient to split the discussion of the two cases. 

Suppose we want to make the Aristotelian EFT invariant under Galilei boosts. We 
need not do anything about spatial derivatives, which are already Galilei-invariant on 
their own. Temporal derivatives can be fixed by contraction with the current (14.41). 
In terms of the body coordinates X’ and the velocity x of the medium, this amounts 
to replacing the time derivative of the NG field z“ of the broken internal symmetry 
with J“ 3 r" = det M (dox! +x-V x1) = det M(Vozx“). The minimal modification 
of the two-derivative part of the effective Lagrangian (8.33) then reads 


1 x 
LEO = — Skea El eog(or)oof Va Vr? 
1 ss 
— shea ll El)org rors ye" Ar dsr” , (14.55) 
1 = A 
Lop? = zeal EDO) Vor "Vor? . 


The two factors of det M coming from the replacement d97° —> Jd,“ are 
without loss of generality absorbed into Kap (|&|). To preserve the internal symmetry 
acting on “, the coupling functions kgp (| El), Kab(|Ž[) and Ago(| El) must satisfy 
the constraints (8.34) and (8.35) for all values of X‘(x). The precise dependence of 
the coupling functions on || can be fixed by measuring the effective couplings Kap, 
Kab and Agp as a function of the density of the underlying medium. 

The only nontrivial bit of the “Galileanization” of the EFT (8.33) lies in the part 
of the Lagrangian with a single time derivative, go In case this is also strictly 
invariant under the internal symmetry, it can be promoted to 


0,1 a 
LOY = —o4 (|B)? (x) Vor" , (14.56) 
where o,(| SD) is subject to the condition flol īp = 0; cf. (8.39). However, in 
case Lr is merely quasi-invariant, we cannot multiply it by an arbitrary function 
of | E|. The only way out then seems to be to set 


LOY = ca) J aun" , (14.57) 
where the 1-form c(z7) = ca(m)dz® is constrained by the required strict invariance 
of its exterior derivative, dc(z). This makes invariance under Galilei boosts mani- 
fest, while quasi-invariance under the internal symmetry acting on 7“ is preserved 
thanks to the identical conservation of J”. Note that in this special case, imposing 
Galilei invariance produces a unique coupling between the NG fields x° and the 
matter variables X', without any new a priori unknown parameters. Mathematically, 
the Lagrangian (14.57) descends from the (d + 1)-form c(zr) A dX! A+++A dX? on 
the coset space of broken internal symmetry. 
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Example 14.4 


Our main example of a system with a ae Lagrangian have been ferromagnets 
(see Sect. 9.2). While most natural ferromagnets are metals, there are also ferromag- 
netic materials that are crystalline insulators. The low-energy EFT of such materials 
brings together ferromagnetic magnons and the phonons of the crystal lattice. In 
this case, the dependence of the two-derivative Lagrangian on the tensor =! is 
more complicated than in (14.55). The reason for this is the reduced emergent 
symmetry of crystals as compared to fluids; see [14] for further details. However, the 
one-derivative part of the Lagrangian is still (14.57), generating a unique coupling 
between magnons and phonons. 


Let us finally see how the above argument needs to be modified, should 
we replace the requirement of Galilei invariance with the relativistic Poincaré 
invariance. In this case, the two-derivative Lagrangian may only feature properly 
contracted Lorentz indices. Embedding the antisymmetric 4,,-term into a Lorentz- 
invariant operator is possible by replacing 0,740.1? > eH J ður yT". 
Analogously, the spatial gradient operator Vi“ - Vat? should be embedded in 
— ga wan? . In order to maintain the form of the two-derivative Lagrangian in 
the local rest frame of the underlying medium, it is then convenient to write it as 


1 
LEO = Kall ENO og ður An” 


1 
- xed ( Blot (rol re” aur n? , (14.58) 


Log? = slRea(l8D — Kea (EDIO; og I By) (J" 0°) . 

This naturally incorporates the possibility for type-A NG bosons to propagate with 
a velocity unrelated to the speed of light while maintaining full Poincaré invariance. 
The coupling functions kgp(| S|), Kab(| E|) and Agp(|S]) are still subject to the 
constraints (8.34) and (8.35) for all values of X’ (x). The one-derivative Lagrangian 
a , should it be strictly invariant, can be coupled to the medium by a slight 
modification of (14.56), 


LED = opl EDot) J aun" . (14.59) 


When A is merely quasi-invariant, it can still be coupled to the classical 
medium via (14.57). Namely, (14.57) in fact respects the infinite-dimensional group 
of transformations preserving the spacetime volume form thanks to its origin in the 
(d+ 1)-form e(r) ^A dX! A---AdX@. This group includes both Galilei and Lorentz 
boosts. 
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Part V 


Epilogue 


®) 


Check for 
updates 


The story of spontaneous symmetry breaking (SSB) and its effective field theory 
(EFT) description does not end here. There are several further facets of SSB that 
would have deserved place in the table of contents of the book. However, giving 
them proper credit would either increase the volume of the book beyond reasonable 
limits, or require a substantial amount of additional background. In this chapter, I 
will give a brief primer on some of these exciting advanced aspects of SSB. I will 
be able to go to detail where it is feasible based on the material covered elsewhere 
in the book. For topics that depart substantially from the main text, I will resort to a 
few basic comments augmented with references for further reading. 


15.1 Effects of Nonzero Temperature 


The great majority of Parts III and IV the book is restricted to the leading order 
of the derivative expansion of the EFT for Nambu—Goldstone (NG) bosons. This is 
just the classical approximation to the EFT, based on the leading-order effective 
Lagrangian and using only tree-level Feynman diagrams. At a few exceptional 
spots, such as the proof of the Goldstone theorem in Sect.6.2 or of the Adler 
zero property in Sect. 10.1, the presented argument is clearly valid beyond the 
classical approximation. However, I have not discussed explicitly perturbative 
loop corrections except for an outline of their role in the derivative expansion; 
cf. Sects. 9.1.1 and 9.2.1. 

The effects of nonzero temperature have thus fallen through the cracks together 
with other loop corrections. Namely, in the so-called imaginary time formalism, 
the temperature of a system in thermodynamic equilibrium enters through discrete 
sums over Matsubara frequencies in loop diagrams. This is a standard part of 
field theory and I will therefore not dwell on details. A reader interested in the 
application of thermal perturbation theory to EFT for NG bosons will find more 
information for instance in [1] (chiral perturbation theory of mesons) or [2] (EFT for 
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ferromagnetic magnons). One generic aspect of SSB worth mentioning explicitly is 
that the order parameter tends to be reduced by thermal fluctuations. As a rule, the 
phase where a symmetry is spontaneously broken will persist only up to certain 
critical temperature, above which the symmetry is “restored.” This is of course 
mere jargon. The symmetry of a system constrains its dynamics at all temperatures. 
However, above the critical temperature, the order parameter for SSB vanishes and 
excitations in the spectrum are organized in multiplets of the full symmetry group. 
See Chap. 7 of [3] for a discussion of symmetry restoration within thermal field 
theory. 

What are the thermal effects on the spectrum of NG bosons? First of all, the 
Goldstone theorem remains valid at nonzero temperature as long as the symmetry 
is spontaneously broken. See Chap. 26 of [4] for a proof of the existence of a 
stable gapless quasiparticle, the NG boson, at nonzero temperature. Likewise, the 
distinction between type-A and type-B NG bosons survives at nonzero temperature. 
A detailed investigation of the thermal spectrum of NG bosons was carried out 
in [5]. At nonzero temperature, quasiparticles manifest themselves by poles in the 
propagator in the lower half-plane of complex energy. The two types of NG modes 
differ in the way their thermal width scales with momentum in the long-wavelength 
limit. As a rule, the complex energy of a type-A NG boson is thus schematically 
E(p) « |p| —ip?, whereas that of a type-B NG boson is E(p) x p? —ip*. In both 
cases, the ratio of thermal width and the real part of the energy tends to zero in the 
limit p — 0. This ensures the stability of the NG modes. 

The imaginary time formalism is only suitable for describing matter in thermo- 
dynamic equilibrium. In the past couple of decades, much progress has been made 
in the development of quantum-field-theoretic methods for thermodynamic systems 
out of equilibrium. The extension of these techniques to EFTs based on nonlinear 
realization of symmetry is fairly recent. The reader will find further details in [6-8]. 


15.2 No-Go Theorems for Spontaneous Symmetry Breaking 


The entire book is based on the assumption that the symmetry of a given system is 
spontaneously broken. It is however equally interesting to try to understand under 
what circumstances SSB may in fact occur. The most striking results in this regard 
form a collection of “no-go” theorems, forbidding SSB in a specific class of theories. 
What follows below is, to the best of my knowledge, a representative list. The results 
included differ somewhat in the level of rigor at which they have been proven, and 
in the mathematical techniques required to establish them. Having just discussed the 
effect of loop corrections, it is natural to start with several related statements where 
the fluctuations of the order parameter play a key role. 


Coleman Theorem In Lorentz-invariant systems, NG bosons necessarily have a 
relativistic dispersion relation, E(p) = |p|. It is easy to see that the corresponding 
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Table 15.1 Subdivision of NG bosons into classes based on the noninteracting part of the effective 
Lagrangian. The latter determines the form of the (inverse) propagator, which in turn gives 
the dispersion relation of the NG mode. The last two columns of the table indicate spacetime 
dimensions, allowed by the condition that the coordinate-space propagator does not diverge at long 
distances 


Type | Inverse propagator | Dispersion relation | Dimension(T = 0) | Dimension (T > 0) 
An |E- p” E(p) x |p|” D>n+2 D>2n+2 
Bon E — p” E(p)« Ipi?” any D> 2 D>2n+2 


free propagator in coordinate space, 


d?p eo ip x 
15.1 
Qr)? p? mn 
is infrared-divergent for D = 2 spacetime dimensions. This translates into a 


divergence of the two-point correlation function of the order parameter at long 
distances. In other words, the fluctuations arising from the NG boson destroy 
the assumed order parameter. This is the essence of Coleman’s theorem [9], 
which forbids spontaneous breaking of a continuous symmetry in two-dimensional 
relativistic systems. 

This argument lends itself to a broad generalization. Consider a generic effective 
Lagrangian whose bilinear part starts at order 2n in spatial derivatives. While the 
n = l option is most natural, we also saw in Sect. 13.3.2 an example of a system 
where n = 2, at least in some spatial directions. More generally, the lack of 
low-derivative contributions to the spatial part of the kinetic term can be naturally 
explained by the presence of a coordinate-dependent symmetry [10, 11]. As to the 
part of the effective Lagrangian carrying time derivatives, we expect either one 
or two derivatives respectively for type-A and type-B NG bosons. This leads to 
a refined classification of NG bosons as type-A, or type-B2n. See the first three 
columns of Table 15.1 for an overview of the schematic forms of the corresponding 
free-particle propagators and dispersion relations. A simple modification of the 
argument leading to Coleman’s theorem now shows that at zero temperature, type- 
An NG bosons are forbidden in D < n + 1 spacetime dimensions. Intriguingly, no 
such a constraint exists for type-B2, NG bosons. In order to have any spectrum of 
quasiparticles, at least one spatial dimension must of course be present. However, 
for any D > 2, the propagator of a type-B2, boson at zero temperature is infrared- 
finite. See the fourth column of Table 15.1 for a summary. 


Hohenberg—Mermin—Wagner Theorem At nonzero temperature, the situation 
changes dramatically. Here the Minkowski-spacetime integral (15.1) is replaced 
with an imaginary-time sum-integral of the schematic type 


+00 d : $ 

d iagt +ip- 

T> / = aa = kila 2 (15.2) 
pore) CT) Gok or wg) + p” 
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where wy = 2kxT is a bosonic Matsubara frequency and T the temperature. 
The type-A, and type-B2, cases correspond respectively to iw, and wr in the 
denominator. Due to the discrete nature of the Matsubara sum, the contributions of 
all terms with k Æ 0 are infrared-finite. An infrared divergence can only arise from 
the zero Matsubara mode, and will be absent provided d > 2n+1 or D > 2n+2 [12] 
(the last column of Table 15.1). While the critical dimension is the same for type-A,, 
and type-B2, NG bosons, note that n here counts the number of spatial derivatives in 
the Lagrangian. For fixed power of momentum in the dispersion relation, the critical 
dimension for type-A2,, NG bosons is higher than that for type-B2,, ones. 


The original results on the absence of SSB at nonzero temperature are due to 
Hohenberg [13] (for superfluids) and Mermin and Wagner [14] (for isotropic 
lattice models of ferro- and antiferromagnets). These covered implicitly the 
cases of type-A; and type-B2 NG bosons. However, their approach was 
different from the above intuitive argument and did not rely on the propagator 
of the would-be NG mode. Rather, they used the so-called Bogoliubov 
inequality to place an upper bound on the order parameter. This bound shows 
that the order parameter goes to zero in the limit of infinite volume and 
vanishing symmetry-breaking perturbation. See [15] for a review and further 
references. 


Landau-Peierls Instability There is another related result that applies to systems 
where the order parameter is spatially modulated in one direction, thus sponta- 
neously breaking translations. This is the case for instance for liquid crystals in 
the smectic-A phase, as we saw in Sect. 13.3.2. Let us briefly recall the line of 
reasoning therein to stress its generality. Suppose that the order parameter is given 
by a scalar field ¢ that develops a nonzero gradient in the ground state. We can treat 
(V@) as a secondary, vector order parameter. The assumption that translations are 
spontaneously broken only in one dimension amounts to the condition that (V œ) 
points in the same direction everywhere in space. Next, we expand ¢@ around its 
expectation value as @ = (f¢) + x and focus on the part of the effective Lagrangian 
bilinear in the fluctuation x. The assumed (and also spontaneously broken) rotation 
invariance dictates that there cannot be any term in the Lagrangian, bilinear in 
the part of Vz, perpendicular to (Vø). The gradient expansion of the Lagrangian 
starts with terms proportional to (Vy)? and (Vir), where || and L denote 
projections to subspaces parallel and perpendicular to (V). The denominator of 
the large fraction in (15.2) should then be replaced with (iw, or wz) + Pi + p$. 
Integrating the k = 0 term over py shows that the order parameter is washed out 
by thermal fluctuations at any nonzero temperature whenever d < 3. This is known 
as the Landau-—Peierls instability, see Sect. 1.6 of [16] and [17] for a more detailed 
discussion within the condensed-matter and nuclear physics context, respectively. 
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Absence of Time Crystals Speaking of spontaneously breaking translations in 
a single direction, it is mandatory to consider the possibility that this direction 
corresponds to time. The idea that time translations could be spontaneously broken 
was proposed in [18, 19]. Such systems have since been known as time crystals. 
The reason why this possibility had not been noticed until the twenty-first century 
is that time crystals are not easy to realize. It was shown soon after the original 
proposal that their existence is forbidden in thermodynamic equilibrium of any 
Hamiltonian with sufficiently short-range interactions [20, 21]. There is however 
a nonequilibrium route towards quantum time crystals; see [22] for a recent review 
of the subject. 


Vafa—Witten Theorem Finally, the list of no-go theorems would not be complete 
without the Vafa—Witten theorem [23]. This is of quite a different nature than all 
the other results above, and applies to Lorentz-invariant gauge theories coupled to 
fermionic matter in a vector-like manner. An important example of such a theory 
is the quantum chromodynamics (QCD). Suppose the theory possesses a vector-like 
symmetry, that is one that acts in the same way on left- and right-handed fermions. In 
QCD, this could be for instance the U(1)g baryon number symmetry or, in the limit 
of equal quark masses, the SU(2)y isospin symmetry. The theorem states that such 
vector-like symmetries cannot be spontaneously broken in the ground state. The 
proof is technical. See also [24] for a review of the relevant mathematical methods 
and their applications to QCD and hadron physics. 


15.3 Topological Aspects of Spontaneous Symmetry Breaking 


We already encountered various topological aspects of EFTs for NG bosons 
on several occasions. Below, I will generalize some of the observations made 
previously, and add new interesting physics. 


Topological Defects and Solitons The classification of topological defects was 
the first major application of topology to physics; see [25] for an introduction to 
the relevant mathematics. Recall that the coset space G/H can be viewed as the 
vacuum manifold of a theory with SSB, whose points indicate possible values of 
the order parameter in the ground state. A defect is a nonuniform configuration of 
the G/H-valued order parameter that is singular on some subdimensional domain 
in space. See Fig. 15.1 for the simple examples of a point and line defect. The 
presence of a defect can be established from the properties of the order parameter 
away from the singularity. Thus, a p-dimensional defect in R? can be enclosed by 
a (d — p — 1)-dimensional hypersurface Xg—p_—1 that is topologically a sphere, 
S4¢-P—!, Equivalence classes of defects can then be determined by studying maps 
S¢-P-! _, G/H, which are classified by the homotopy group Ta—p—1(G/H). This 
assigns the defect a unique label: its topological charge. An exception are domain 
walls, which are of codimension one in space, dividing it into two halves. A domain 
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Fig. 15.1 Examples of 

topological defects ind = 3 

spatial dimensions together 

with the surfaces defining 

their topological charge. The < Sie 
left panel shows a point > 
defect (monopole), the right l 
panel a line defect (vortex) 22 | 


Table 15.2 Classification of Defect Dimension p | Classifying group 
basic topological defects by = a a 
‘ ; ik Domain wall | d — 1 mo(G/H) x m9(G/H) 
their codimension in space 
Rd Vortex d-2 mwi(G/H) 
Monopole d—-3 m2(G/H) 


wall is thus characterized by the values of the order parameter on both sides. See 
Table 15.2 for an overview of basic types of topological defects. 


We saw examples of domain walls in Sect. 13.4. The double-well potential (13.78) 
has a discrete G ~ Z2 symmetry under 6 — —d@. This is broken in its minima 
to H ~ {e}, hence G/H ~ Zp and likewise mo(G/H) ~ Z2. The cosine 
potential (13.80), on the other hand, has the symmetry group G ~ Z2 x Z. The 
Z2 factor is generated by the inversion ¢ — —@ whereas the Z factor by the shift 
ọ — + 2xv. In any of the minima of the potential, the symmetry is broken 
to a subgroup isomorphic to H ~ Z2. Therefore, in this case G/H ~ Z and 
accordingly m9(G/H) ~ Z. The elements of mo(G/H) x mo(G/H) shown in 
Table 15.2 correspond to domain wall solutions interpolating between different pairs 
of minima. 


The focus on singular field configurations may be surprising; the physics should 
not change if we “smooth down” the order parameter. This is however only 
possible at the cost of embedding the coset space G/H in a larger order parameter 
manifold M. 


Consider a superfluid in d = 2 dimensions. Here the order parameter (yr) takes 
values from C, while the vacuum manifold is G/H ~ U(1)/{e} ~ S! for any (Y) Æ 
0. A point defect (vortex) in the superfluid is described by a smooth complex field 
w(x). The corresponding topological charge is the winding number, defining an 
element of 71 (G/H) ~ 1(S') ~ Z; see also Example 12.4. Importantly, nonzero 
winding number implies by the argument principle (Chap. 7 of [26]) the existence 


15.3 Topological Aspects of Spontaneous Symmetry Breaking 367 


of a point x € R* where w(x) = 0: the core of the vortex. This is our “singularity” 
where the order parameter cannot lie on the vacuum manifold G/H ~ S!. 

For another example, consider a spin system in d = 3 dimensions with G/H ~ 
S?. Suppose that on the sphere £z as shown in Fig. 15.1, the order parameter 
represented by a unit vector n € S$? points radially outwards everywhere. This 
spin configuration is sometimes called hedgehog. If we now try to continue the 
field to the inside of the sphere, we will inevitably encounter a point singularity: a 
monopole. Possible values of its topological charge are classified by 72(G/H) ~ 
m2(S*) > Z. The singularity can be avoided only at the cost of deforming the spin 
configuration out of the coset space G/H. The natural way to do so is by embedding 
G/H ~ S? in R? and allowing n to vary its magnitude. One can then extend the 
hedgehog field on X} to the inside by scaling down its magnitude so that it vanishes 
at the location of the monopole. 


In contrast to defects are topological solitons (Sect. IX of [25]). These are 
fields in R? that are smooth and take values from G/H everywhere. Nontrivial 
topology arises from imposing a specific boundary condition at spatial infinity. 
Thus, the field configuration of a p-dimensional soliton is required to converge to 
a constant far from a fixed p-dimensional hypersurface Cp in R¢. Now consider 
an open hypersurface Xqg—p that is “transverse” to Cp. The boundary condition 
effectively compactifies this hypersurface to a sphere, S’~?. Equivalence classes 
of p-dimensional solitons in R? are therefore classified by the homotopy group 
Ta—p(G/H). 


Example 15.3 


The simplest type of a topological soliton is that with p = 0. Such solitons can be 
viewed as quasiparticles localized in all directions in space, and typically carry a 
finite amount of energy. The transverse hypersurface Xg is in this case the entire 
space R?. We already met two examples of such solitons in Chap. 9: skyrmions in 
QCD, corresponding to d = 3 and 13(G/H) ~ 13(S°) ~ Z, and baby skyrmions in 
two-dimensional ferromagnets, for which 72(G/H) ~ m2(S*) ~ Z. The latter are 
easily generalized to higher dimensions, simply by making the fields independent 
of whatever extra coordinates are present. Thus a skyrmion in a three-dimensional 
ferromagnet is localized to a line, C1. Its topological charge is determined by the 
properties of a unit vector field on a two-dimensional surface X2, transverse to C1. 


Topological Terms in the Action We saw in Sect. 8.1 that the part of the effective 
Lagrangian for NG bosons with a single time derivative may be topologically 
nontrivial. Similarly, it turned out in Sect. 9.1 that the part of the low-energy EFT 
of QCD, taking into account the chiral anomaly, is topological in nature. We shall 
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now generalize these observations, thus uncovering a broad class of contributions to 
EFTs for NG bosons, arising from the topology of the coset space G/H. 

Suppose that G/H is compact and consider a closed p-form w?) that belongs to 
a nontrivial cohomology class in the p-th de Rham cohomology group, H?(G/H).! 
Upon pulling w”) back to a spacetime M of dimension D by the map M > G/H 
defining the NG fields, it becomes a likewise closed p-form on M. There are now 
three different possibilities depending on the relation between p and D. If p = D, 
the spacetime p-form can be directly integrated and added to the action of the EFT. 
This is called a 6-term. Due to its origin in a closed D-form, it is a mere surface 
term in the Lagrangian and thus does not affect the perturbative dynamics of the 
EFT. This is the reason why @-terms have not featured in the book at all. If present, 
they contribute a phase factor to the generating functional though, which may affect 
the nonperturbative physics of the theory. 

Another relevant possibility is p = D + 1. In this case, there is a locally well- 
defined D-form œP? on G/H such that dw) = wP). Upon pulling back to 
the spacetime, œP? gives a contribution to the Lagrangian that is quasi-invariant 
under the action of G. This is a Wess—Zumino (WZ) term. The anomalous contri- 
bution to the low-energy effective Lagrangian of QCD is obviously of this kind; 
cf. Sect. 9.1.4. However, the single-time-derivative part of the effective Lagrangian, 
wo , constructed in Sect. 8.1.3, belongs to the same category. Here only the 
variation of the NG fields in time matters and thus effectively D = 1, which makes 
H?(G/H) the relevant de Rham cohomology group. Following the line of reasoning 
of Sect. 9.2.4, we see moreover that such WZ terms generally give rise to a Berry 
phase when the ground state is driven by an external field [27]. See Table 15.3 for 
a summary. Further technical details on the construction of topological WZ and 6- 
terms in the action can be found in the recent study [28]. For a somewhat more 
condensed-matter oriented perspective on topological terms, see Chap. 9 of [29]. 


Table 15.3 Physical interpretation of generators of the de Rham cohomology group H? (G/H), 
depending on the dimension D of spacetime they are pulled back to. For p < D, the pull-back 
leads to a closed p-form on the spacetime, Hodge-dual to an identically conserved tensor current 
of rank D — p. In the table, the rank of currents J“”" is indicated by the number of indices 


Dimension p=1 p=2 p=3 p=4 p=5 
D=1 6-term WZ term - = = 

D=2 Jt 6-term WZ term - - 

D=3 ju Jt 6-term WZ term - 

D=4 piv JP? Jt 6-term WZ term 


1 I follow the convention common in high-energy physics and indicate the degree of a differential 
form with a superscript in parentheses. 
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15.4 Generalized Symmetries 


It remains to clarify the content of the lower left corner of Table 15.3. For p < D, 
the spacetime p-form arising from pulling back wP? can be expressed as the Hodge 
dual of a (D — p)-form, x J‘?~?). The form JPP? has a vanishing codifferential; 
cf. Appendix A.6.5. In a flat spacetime, this can be expressed component-wise as 


aust" =0, (15.3) 


where J”#™" is fully antisymmetric in all its indices. Equation (15.3) is nothing 
but a local conservation law with a tensor current of rank D — p. It encodes, on 
the classical level, a generalized symmetry. This (and other) kind of generalization 
of conservation laws has attracted much attention in the last decade. An interested 
reader will find a comprehensive introduction for instance in [30-32]. 

In the context of EFT for NG bosons, the identically conserved currents J#”” 
give us a tool for calculation of topological charges of defects and solitons, classified 
in Sect. 15.3. Mathematically, this is based on the correspondence between the 
homotopy and de Rham cohomology groups of G/H via the Hurewicz theorem (see 
Appendix A.8.2). Physically, the conservation law (15.3) suggests the existence of 
a conserved charge, obtained by integrating the temporal component, J0”, over 
a spatial hypersurface. For a static defect or soliton, it is easiest to pull œP? back 
directly to the space R? and express it therein as x J~”). The (d — p)-form JETP? 
is the density of our topological charge. The topological charge itself is obtained by 
integration over an effectively closed p-dimensional surface Xp in RI, 


Q(X») = / IEP | (15.4) 
Xp 


The fact that Xp is closed guarantees that the value of Q (£ p) does not change under 
smooth deformations of the surface. Depending on circumstances, (15.4) represents 
the topological charge of a (d — p — 1)-dimensional defect enclosed by Xp, or that 
of a (d — p)-dimensional soliton intersecting Xp. 


Example 15.4 


The coset space G/H ~ U(1) ~ S! of superfluids has nontrivial first de Rham 
cohomology group, H'(G/H) ~ R. Pulling its generator back to space and 
integrating over a closed loop X; defines the winding number of a (d — 2)- 
dimensional defect: the vortex. The coset space G/H ~ SU(2)/U(1) ~ S° of 
ferro- and antiferromagnets has nontrivial second de Rham cohomology group, 
H?(G/H) ~ R. Pulling its generator back to R? and integrating over the entire 
plane, ©. = R?, defines the charge of a 0-dimensional soliton: the baby skyrmion. 
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All the conservation laws of the type (15.3) discussed until now were emergent. 
Namely, their existence follows from the topology of the coset space G/H, which 
itself parameterizes the breaking of the symmetry we started with. There are 
however also other, important examples of generalized conservation laws. For 
instance, the Maxwell equations in absence of matter take the form of a rank-two 
conservation law, 0, F"” = 0. In this case, the charge (15.4) obtained by integration 
over a closed spatial surface is nothing but the electric flux through that surface. 
The “defect” is a localized electric charge enclosed by the surface. Intriguingly, 
the spontaneous breakdown of the generalized symmetry of electrodynamics is the 
ultimate reason why the photon is massless. This brings us back to the beginning of 
the book: we are constantly surrounded by a sea of NG bosons, namely sound and 
light. A reader wondering whether the classification of NG bosons into type-A and 
type-B also applies to generalized symmetries will be glad to hear that this is indeed 
the case. See [33] for a discussion of the classification and counting of NG bosons 
of generalized symmetries, and [34] for a nontrivial example of a system where the 
photon coupled to a particular type of matter behaves as a type-B NG mode. 
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A 
| | 
Check for 
updates | 


I will close the book by highlighting some technical issues that I could not have 
addressed properly. I label them as “open questions,’ at the risk of revealing 
my own ignorance rather than a gap in the existing knowledge. Since the book 
revolves around developing a formalism for spontaneously broken symmetry, some 
of the items are purely mathematical. The first commandment of effective field 
theory (EFT) dictates that the effective Lagrangian must contain all operators that 
respect the symmetry of the system. Accordingly, an important ingredient of the 
EFT formalism is the classification of all possible Lagrangians consistent with the 
symmetry. This requirement lies behind most of the issues listed below. 


Universality of the Standard Nonlinear Realization The classification of group 
actions on a manifold in Chap. 7 relies heavily on the assumption that the 
isotropy subgroup H of the symmetry group G is compact. Any progress towards 
classification of group actions such that H is noncompact would be most welcome. 
This applies in particular to coset spaces G/H that are not reductive, for which 
many of the nice features of our standard nonlinear realization are lost. 


Global Existence of the Group Action The standard nonlinear realization of 
symmetry developed in Chap. 7 is restricted to a single local coordinate patch. 
Accordingly, the explicitly constructed action of the group G is limited to its 
elements near unity. Yet, the global existence of the group action on the whole 
manifold M, or coset space G/H, has been implicitly assumed throughout the book. 
A better understanding of possible topological obstructions to extending the group 
action from a local coordinate patch to the entire manifold would be desirable. 
This issue is particularly pressing for Lie groups consisting of several connected 
components. As far as I know, the first attempt to systematically deal with such cases 
was made in [1]. However, more work is needed to establish a general, practically 
useful formalism. 
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Group Action via Generalized Local Transformations Most of the book focuses 
on symmetries realized by point transformations on a target space M or its Cartesian 
product with the spacetime M. This allows one to use the mathematical language 
of group actions on a finite-dimensional manifold. However, we have also seen 
some physically relevant examples of generalized local symmetries, where the 
transformation of fields is allowed to depend on their derivatives. This suggests 
working directly in the infinite-dimensional space of fields as maps M > M. 
With the knowledge of the symmetry group and the symmetry-breaking pattern, 
one may then still apply the agnostic nonlinear realization, mentioned in Chaps. 12 
and 13. To what extent this exhausts all possible actions of the symmetry remains 
unclear. Yet, a setup that makes the implementation of generalized local symmetries 
systematic has long been in use in the theory of differential equations [2]. First 
attempts to apply this setup to EFT appeared very recently [3,4]. Hopefully, they 
will help to place the treatment of generalized local symmetries on the same footing 
as that of point symmetries. 


Interplay of Nonlinear Realization and Inverse Higgs Constraints As stressed 
in Chap. 13, the operational way of eliminating would-be Nambu—Goldstone (NG) 
fields that excite gapped modes using an inverse Higgs constraint (IHC) is optional. 
The EFT for solely genuine NG degrees of freedom, obtained by applying an IHC, 
may carry a realization of the symmetry by generalized local transformations. On 
the other hand, the EFT before the IHC is imposed is ambiguous in that its field 
content may depend on the choice of order parameter. Either way, the universality 
of the nonlinear realization of the symmetry may be compromised. Here, too, more 
work is needed to establish that the existing formalism(s) for spontaneously broken 
spacetime symmetry give(s) the most general effective Lagrangian. 


Nonlinear Realization of Translation Symmetry Possibly the largest gap in the 
narrative of the book is the treatment of spontaneously broken translation symmetry. 
The main challenge is to establish a unique local parameterization of given fields 
in terms of NG variables, one for each broken translation generator. Thus, the 
discussion in Chap. 13 is limited to order parameters spatially modulated just in one 
direction. A generalization of the formalism to ordered states of matter, modulated 
in several dimensions, is pending. The same obstacle hinders the application of the 
background gauge approach, which proved extremely efficient in case of internal 
symmetries (Chap. 8). Some concrete applications of this approach beyond the 
single example worked out in Chap. 13 can be found in [5]. However, a fully general 
background gauge formalism for construction of EFTs for spontaneously broken 
spacetime symmetry is not available as yet. 
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Most of the book relies on mathematical background familiar to any graduate stu- 
dent of theoretical physics, that is mostly advanced calculus and basic group theory. 
However, some topics are greatly illuminated by taking a more geometric viewpoint. 
For instance, differential calculus on manifolds and Riemannian geometry provide 
useful insight into the nature of coset spaces on which effective field theories for 
spontaneously broken symmetry are defined. This is discussed in Chap. 7. Likewise, 
the classification of effective theories for broken internal symmetries worked out 
in Chap. 8 is greatly simplified by using differential calculus on manifolds and 
elements of de Rham cohomology. Las but not least, Riemannian geometry provides 
a practically convenient language for the analysis of scattering of Nambu—Goldstone 
bosons in Chap. 10. 

The purpose of this appendix is to give the reader the background needed to 
follow such more advanced parts of the book. I try to keep the discussion elementary, 
focusing on qualitative understanding rather than on mathematical rigor. Wherever 
possible, I justify the need for new concepts and motivate their formal definition. 
This necessarily makes the exposition biased. My approach might be natural for 
someone familiar with quantum field theory who seeks to develop advanced calculus 
methods to deal with fields that live on a manifold. The appendix can however be 
studied independently without any reference to the main text of the book. A more 
complete introduction to differential geometry at a similar level as here can be found 
in [1]. A reader looking for a more thorough treatment of the subject at a level still 
accessible to a graduate student of theoretical physics is advised to consult [2—4]. 


A.1 Smooth Manifolds 


The basic concept of differential geometry is that of a manifold. This is an 
abstraction that arose historically from the study of non-Euclidean geometry and of 
the geometry of surfaces. Formally, a manifold M of dimension n is a set equipped 
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with the notion of closeness (topology) that can be locally mapped to R” in a one- 
to-one manner. More precisely, one assumes the existence of a collection of pairs, 
{(U;, pi)}, where U; is an open subset of M and g; : U; —> R” is a homeomorphism. 
(The latter is defined as a continuous map that is invertible and whose inverse is also 
continuous.) Borrowing terminology from cartography, each pair (U;, g;) is called 
a chart, and the collection of all charts is an atlas. It is required that each point 
x € M belongs to at least one of the sets U;. The point g;(x) € R” corresponds 
to an n-tuple of real numbers x“, a = 1,...,n, called the coordinates of x. It is 
common if a bit sloppy to identify the point x € M with its image g; (x) or even the 
coordinates x“, and I will often follow this convention. 

Mapping a point x € M to its coordinates is both a blessing and a curse. On 
the one hand, the coordinates allow us to import to the manifold mathematical 
structures we are familiar with from ordinary calculus. On the other hand, it may 
not be possible to cover the whole manifold with a single set U;. The coordinates 
are in general only defined locally in some neighborhood of x. We must therefore 
be able to switch from one set of coordinates to another. The first commandment 
of differential geometry is that whatever structure we build on the manifold, it must 
have an intrinsic meaning regardless of a particular choice of local coordinates. 


Example A.1 


The M = R” is itself an n-dimensional manifold. It can be covered with a single 
coordinate chart, (U, g) = (R”, id), corresponding to usual Cartesian coordinates. 
The coordinates can however also be chosen differently, in which case a single chart 
may not be sufficient to cover the whole R”. For example, let us consider R? and use 
the polar coordinates 0, 6 instead of the Cartesian coordinates x, y. The map 


(x,y) > (0,0) = (ve + y?, sgn y arccos a) (A.1) 
x4 +y 


is continuous and invertible on U = R? \{(x, 0) | x < 0}, making U homeomorphic 
to the domain (0, +00) x (—7, +7) C R?. To cover the whole plane M = R?, we 
would need to add at least one more chart. 


The first thing we can do with the local coordinates is to introduce a differential 
structure on M. Let us choose a chart (U; , gi). We then define a smooth real function 
on U; as a map f : U; — R such that f o gy, is an infinitely differentiable 
function on g;(U;) C R”. We would of course like the concept of smoothness to 
be independent of the choice of chart. This is ensured by requiring that for any 
two overlapping open sets U;, U;, the map g; o 9; that switches between the two 
coordinate systems is itself infinitely differentiable. A function defined on the whole 
manifold, f : M — R, is now called smooth if it is smooth when restricted to any 
single coordinate chart. 
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Fig. A.1 Stereographic 
projection of the n-sphere S” 
to R”, here for n = 1. First, 
S” is naturally embedded in 
R”+! as a unit sphere 
centered at the origin. A point 
P on S” is then projected to 
the point Q in R” using a 
straight line starting at the 
north pole N of the sphere. 
An alternative projection 
from the south pole S would 
map P to Q' 


One way to define local coordinates on the n-sphere S” is through stereographic 
projection, see Fig. A.1. We imagine that $” is realized (embedded) in R”+! as a 
unit sphere centered at the origin. The embedding is defined implicitly by (x!)* + 
see +(x")? +z? = 1, where (x!,..., x”, z) are Cartesian coordinates in R’+!. A 
given point P € S$” is then projected from the north pole N to the hyperplane R” 
defined by z = 0. The image of P is the point 


X X 


1 n 
2 1 ny o 
osoh D= (ee) (A.2) 


The set (y!,..., y”) constitutes the desired local coordinates on $”. These coor- 
dinates are well-defined on the whole sphere except for the north pole N itself. 
To cover the entire sphere, we need another coordinate patch. This can be defined 
through a similar projection from the south pole S. The latter maps P to 


1 n 

/ at m x x 
= aas = peeey — ]. A.3 
Q =y y”) (= =) (A.3) 


Noting that 


|x| 1l+z ; |x| l-z 
ae n] l E = À AA 
a ra | eae Ol Gag T (A4) 
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we find that the two sets of local coordinates are related by inversion, y/@ = y“/|y|?. 
This ensures smooth transition between the two coordinate systems wherever they 
are both well-defined, that is on S” with both poles removed. 


A.2 Linear Structures on Manifolds 


We have now learned what a smooth function on a manifold is. Since our primary 
goal is to develop calculus on manifolds, we would eventually like to differentiate 
and integrate such functions. In order to be able to do so independently of the choice 
of local coordinates, we will however need additional structure on the manifold. 


A.2.1 Tangent Vectors and Vector Fields 


The first step towards setting up calculus on manifolds is to define a derivative of a 
function. In R” with its Cartesian coordinates, this naturally leads to the notion of a 
partial derivative. On a general manifold M, there is however no such a preferred set 
of coordinates. The way out is to generalize the concept of a directional derivative 
in R”. Thus, for a fixed point x € U; C M and a set of coefficients vl, ..., v”, we 
define a linear operator v acting on smooth functions f on U; through 


a _ 4, fog’ 
so =v 


ox4 


(A.5) 


x 


Such a linear operator is usually called a tangent vector to M at x. Accordingly, the 
set of all tangent vectors at x is called the tangent space of M at x and denoted as 
TM. Let me stress that the adjective “tangent” here is just a convention. There is 
no way to give it a literal geometric meaning unless we imagine M as a hypersurface 
embedded in a higher-dimensional Euclidean space. 

The definition (A.5) relies on the components v“ in a fixed coordinate basis of the 
tangent space, 0g = 0/dx“. However, the concept of a tangent vector has a meaning 
independent of a specific choice of local coordinates. We just have to remember 
to recalculate the components of the vector if the local coordinates change, as the 
following example illustrates. 


Example A.3 


Suppose that we trade the Cartesian coordinates x, y in R? for the polar coordinates 
o, 0 through the usual prescription (x, y) = (@ cos 0, ọ sin 0). The coordinate basis 
of the tangent space is then converted by using the chain rule for partial derivatives, 


dg = (cos O)dx + (sin@)dy , dg = —(@sin@)d, + (@cosO)dy . (A.6) 
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In order for a vector v to be well-defined independently of the choice of local 
coordinates, its components (v*, v) and (v2, v) in the two coordinate systems 
must be related by 


; j v? 
v? = v“ cos + v” sing , v? =—~ sind + —cosd. (A.7) 
Q Q 


These results agree with what elementary planar geometry would tell us if we treated 
dg and (1/@)d9 as unit vectors in the radial and azimuthal direction, respectively. 


In general, if we change the local coordinates x“ smoothly to x“(x), the 
coordinate bases and vector components in the new and old coordinate systems are 
related by 


z ôx? a 3, 
da = axa Ob y = Jb” : (A.8) 


The first of these is just the chain rule, whereas the latter follows from the 
requirement that the vector v = v“d, = lð, is independent of the choice of 
coordinates. Note that 0x“ /ðx” is just a shorthand notation. If x“ and x“ correspond 
to coordinate maps ¢;, 9j, this notation represents the partial derivatives of gj og; ', 
evaluated at y;(x) € R”. In the following, I will occasionally use such intuitive 
notation without further warnings. 

For most applications in physics, a vector field is of greater interest than a 
(tangent) vector defined at a single point x € M. Just like for real functions on 
M, it is natural to require that a vector field varies smoothly as a function of x. 
This is most easily ensured by demanding that, within a local coordinate patch, the 
components vf are themselves smooth functions of x. It is also possible to define a 
vector field without referring to specific coordinates. One thus treats it as a collection 
of vectors v(x) € TyM such that v(x)[f] is a smooth function of x for any smooth 
“test function” f on M. 

With the notion of a vector field at hand, let us briefly return to the choice of 
basis of the tangent space. With local coordinates x“ defined on an open set Uj, 
we get a set of coordinate basis fields 0, that are likewise well-defined only on Uj. 
It is however often advantageous to have a basis that is not limited to a particular 
coordinate patch, or not related to a specific set of coordinates at all. One thus defines 
a (local) frame as a set of vector fields e4, A = 1, ...,, defined on some open set 
U C M, such that the set of vectors {e4(x)}",_, is a basis of 7, M at any x € U.! 
Any vector field can then be expressed in terms of its components with respect to the 


' Within this appendix, I will carefully distinguish coordinate indices a, b,... from frame indices 
A, B,....In practice (including the main text of this book), the two notations are often blended. 
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frame, v(x) = v^ (x)e 4 (x). I will sometimes use a local frame instead of coordinate 
basis fields; the latter can always be viewed as a special case of the former. 


Restricting the definition of a local frame to an open subset of M is not 
as innocuous as it might appear. The existence of a frame well-defined on 
the whole of M depends on the global topology of M. Even on a manifold 
as simple as the 2-sphere S7, there is no smooth vector field that would be 
nonzero everywhere! This is the “hairy ball theorem” of algebraic topology; 
see Sect. 13.7 of [1] for an elementary discussion. 

A second remark of caution is that a local frame is not merely a coordinate 
basis in disguise. By the definition of a basis, one can always relate a frame 
ea to any coordinate basis 0g by ea = 49a, where Ci is a matrix of 
coefficients with nonzero determinant. In general, there is however no set of 
local coordinates yo, A = 1,...,n such that e4 = a/ay4. As I will show 
later, this is only possible if the condition e 4[c%] = eg[c4] is satisfied for all 
A, B anda. 


A.2.2 Tensors and Tensor Fields 


Once we know how to construct tangent vectors and vector fields on a manifold, 
we can quickly build tensors and tensor fields of an arbitrary rank. While the 
motivation for doing so may not be as immediate as for tangent vectors, several 
types of tensors are of central importance to differential geometry. To start with, the 
cotangent space of M at x, 7; M, is defined as the dual of T, M. Its elements, called 
cotangent vectors or covectors, are linear maps from 7, M to R. The counterpart of 
a coordinate basis dg on TyM is the dual coordinate basis on 7; M, usually denoted 
as dx“. Hence dx“ (3p) = ô; . The action of any covector on any vector then follows 
from linearity and can be expressed component-wise, 


w = Wdx* , v = vfða > wV) = Mav’ , wEeETM, vET;M. 
(A.9) 


The choice of notation for the dual coordinate basis is not accidental. Any smooth 
function f defined in a neighborhood of x gives rise to a covector df through 


af 


df(v) = v1 fl= vo, 
XxX 


veTxyM. (A.10) 


I have simplified the notation by treating f as a function of the local coordinates 
x; cf. (A.5). By the definition of the dual basis, we have v? = dx“(v). Hence 
df = (0f/dx")dx", generalizing the differential of a function known from ordinary 
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calculus. It follows that upon switching the local coordinates from x“ to x“ (x), the 
dual coordinate basis and components of covectors transform to 


ax4 ox? 
dz = Z axt, = yg = a. (A.11) 
ax 


Higher-rank tensors can now be built using tensor products of TyM and 77M. 
Generally, a tensor of type (q, p) atx € Mis an element of (Ty M)81 @ (T<M)®?. 
A class of tensors of special importance to both differential and integral calculus on 
manifolds is obtained by taking the p-fold antisymmetric tensor product of 7; M, 
where p is a positive integer. The elements of this space, Q M, are called p-forms 
and define antisymmetric multilinear functions on TyM. Note that Q 1M coincides 
with TM; covectors are indeed often referred to as 1-forms. An antisymmetric 


tensor product of p covectors @1,..., @p is represented compactly by the notation 
OL A+++ A Op = SENT (1) B+ @ Orp) » (A.12) 

T 
where the sum is over all permutations m of the indices 1,..., p. Using this 


notation, we can write a general element w € Qe M in the dual coordinate basis 
as 


1 


w = paa A+++ Ad? , Way--ay = O(a,» - - -> ay) - (A.13) 


The expression for the components of the p-form is a direct generalization of the 
relation wa = æ (ða) for 1-forms. 

Equation (A.12) is more than just a useful shorthand notation. It can be extended 
to an operation on forms of arbitrary degree that is linear and associative: the 
exterior product. With the dual coordinate basis expansion (A.13) at hand, linearity 
and associativity require that any two forms œ € Q M anda € 24M are mapped 
to the (p + q)-form 


1 
wno = Digeon Tbb d" Ae adx adx! A+. adx. (A14) 


The same result can be alternatively expressed in a coordinate-independent manner 


by the action on a set of test vectors v1, ..., Vp+q from T; yM, 
(wA), ..., Uptq) 
1 (A.15) 
> SENT O(Vq(1), - +5 Va(p))O Wap)» +++ Ux(p+q)) > 


pig! & 
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where x now runs over permutations of 1,..., p + q. The antisymmetry of forms 
as multilinear maps implies that the exterior product is graded-anticommutative, 


wno = (1Ps no, we RM, co EQim. (A.16) 


The direct sum Bp =0 QEM equipped with the exterior product possesses the 
structure of an associative algebra: the exterior (Grassmann) algebra. Here RM 
is identified with R and the direct sum terminates at p = n; by the antisymmetry 
property there are no p-forms with p > n on an n-dimensional manifold. 

Another operation on forms that will prove useful is the interior product. This 


assigns to any vector v € T, Ma linear map u : artim —> QM defined by 
Wo(V1,...,Up) = @(V, V1, ..., Vp), Vi, ..., Vp ETM (A.17) 


for any œ € QP +I M. While this definition is coordinate-independent, the interior 
product takes a particularly simple form in the component notation where it amounts 
to contracting the vector v with the (p + 1)-form o, (ly®)ay---ap = OO pasos 
Thanks to the antisymmetry of forms, the interior product is anticommutative that 
iS tu Oly = —ly Oly for any two tangent vectors u, v. Moreover, ty with any fixed v 
is an antiderivation of the Grassmann algebra: for any p-form w and q-form o one 
has 


wlw Aa) = (yw) Ao + (-1)?@A (yo) . (A.18) 


In fact, the properties of the interior product are already completely fixed by 
requiring that it is an antiderivation that reduces to tyw = w(v) on 1-forms w. 

So far I have limited the discussion in this subsection to tensors defined at a 
single point x € M. However, we can build tensor fields of arbitrary type (q, p) 
in complete analogy with the construction of vector fields out of tangent vectors at 
different points. 


Example A.4 


A metric g is a symmetric tensor field of type (0, 2) such that g(x) € TM O T¥M 
is a positive-definite bilinear form for any x € M. For example, R” as a Euclidean 
space is equipped with the metric g = d,,dx“ @ dx” where x“ are the Cartesian 
coordinates. The metric can of course be converted to any other coordinates by 
using (A.11). For instance, in R? we may want to switch to the spherical coordinates 
(r, 0, o), defined by (x, y, z) = (r sin@ cos g, r sind sing, r cos 0). The metric then 
assumes the form 


g = dr Q dr +r7d0 @ dO + r° sin? 6 dy @dg. (A.19) 


An interested reader will find more about the metric tensor in Sect. A.6. 
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Promoting a p-form to a tensor field leads to the concept of differential p-form.” 
Just like for vector fields, we have the freedom to choose other bases for differential 
p-forms than those induced by local coordinates on M. Thus, a (local) coframe is a 
set of differential 1-forms e*4 defined on some open set U € M such that the set of 
1-forms {e*4 (x)}"4_1 Spans a basis of a. M at any x € U. Any frame e4 induces 
a unique dual coframe e*“ (and vice versa) by requiring that e*4(eg) = 54. All 
the properties of p-forms reviewed above can be immediately promoted to frame 
components of differential p-forms by the replacements 0, —> e4, dx“ — e*4 and 
Daya) © DAA, wherever necessary. The same remark applies to tensor fields of 
an arbitrary type (q, p). 


Let us use the spherical coordinates (0, g) to parameterize the 2-sphere $°. 
Following Example A.4, this allows us to define a metric on S? by restricting (A.19) 
to fixed radial variable, r = 1, 


g = dô Q d0 + sin? Ody 8 dọ . (A.20) 


The vector fields 


1 
e9 = d , ep = — A.21 
e 2 gino? ee 
then constitute an orthonormal frame, g(e4,ep) = 654g, wherever well-defined. 
The corresponding dual coframe is e*? = d6 and e*? = sin@ dg. Out of these two 
1-forms, we can naturally build a 2-form through 


wo =e a e"? = sind dd ^ dọ . (A.22) 


One might be tempted to think of the frame (eg, ey) as defining an area element on 
the sphere that can be subsequently used for integration. However, as I will explain 
in Sect. A.7, integration on manifolds is done using differential forms. For the 2- 
sphere, it is exactly the differential 2-form (A.22) that does the job of defining the 
proper area element. 


A.3 Maps Between and on Manifolds 


In Sect. A.1 we saw how to define a smooth function on a manifold M as a map 
from M to R. This concept of smoothness is easily generalized to maps between two 


? For the sake of brevity, I will mostly drop the adjective “differential” where it is clear from the 
context that I speak of a tensor field rather than a tensor defined at a single point. 
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Fig. A.2 A smooth map f 
between manifolds M and N, 
defined locally in the ff ep eee 
neighborhood of the point 

x € M. Composing a test 

function g, defined locally 

around the image f(x) € N, 

with f allows one to lift the 

map f to the push-forward 

Íe : TxM => TFN 


POE 
ve) 


manifolds M and N. Fora map f : M — N that is continuous, it is always possible 
to choose a chart (U, g) on M and a chart (V, x) on N such that f(U) C V. The 
map is then called smooth if x o f o g7! is infinitely differentiable. A smooth map 
f :M-— N that is a homeomorphism is called a diffeomorphism. 


A.3.1  Push-Forward and Pull-Back 


Smooth maps between manifolds naturally induce linear maps between the corre- 
sponding (co)tangent spaces. Since tangent vectors are differential operators acting 
on locally defined test functions, we first have to inspect how the map f : M > N 
affects the latter. It is clear from Fig. A.2 that by “connecting the arrows,” any test 
function g : N — R can be converted into the function g o f : M —> R. This is 
enough to connect tangent vectors on the two manifolds. Formally, one starts with a 
tangent vector v € TyM and maps it to a tangent vector fv € T ¢(.)N such that 


fx vlg] = vigo f] (A.23) 


for any test function g defined in a neighborhood of f(x) on N. The image fv is 
referred to as the push-forward of v by f. Suppose that we have a set of coordinates 
x“ in a neighborhood of x on M and a set of coordinates y’ in a neighborhood of 
f(x) on N. It then follows from the chain rule that the coordinate basis vectors at x 
are mapped to 


where f? is a shorthand notation for the b-th coordinate of f (x). This looks almost 
like a change of variables. It is therefore worth stressing that the manifolds M and 
N need not have the same dimension, hence the map f need not be invertible. 
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Example A.6 


Consider again the 2-sphere with its spherical coordinates (0, gy) and define the map 
f : S? — R? as the natural embedding that takes the point (0, g) € S? to (x, y, z) = 
(sin@ cos gy, sin@ sing, cos 0) € R?. Then (A.24) gives us 


[x09 = (cos 0 cos ~)d, + (cos @ sing)dy — (sin 0)ð; , 
f (A.25) 
fxg = — (sin 0 sin p)dy + (sin 0 cos ~)dy . 


This “realizes” tangent vectors to the sphere as vectors in R*. The same strategy can 
be used to give a literal geometric interpretation to tangent vectors on any smooth 
manifold by embedding the latter in a higher-dimensional Euclidean space. 


Once we know how to relate tangent vectors to M and N, we can do the same 
for covectors by using the dual nature of the cotangent space. The “connecting the 
arrows” strategy hints that the map between the cotangent spaces acts in the opposite 
direction than f. Thus, we assign to w € T Na covector f*w € T*M so that 


f*ow)=o(fv) forany vET,M. (A.26) 


For obvious reasons, the image f*w is referred to as the pull-back of œw by f. The 
chain rule tells us that covectors from the dual coordinate basis are mapped to 


af? 


f*dył = gaw : (A.27) 
Xx 


which again looks like a mere change of variables but is really much more than that. 


By following the same line of reasoning, one can define the push-forward f, 
for any tensor of type (q, 0) and the pull-back f* for any tensor of type (0, p). 
In general, it is however not possible to define either of the two for tensors of 
the mixed type (q, p). Ina similar vein, it is generally not possible to promote 
the point-wise map f, : TyM — TQ WN to a push-forward of vector (or 
tensor) fields. A problem arises when two points x1, x2 € M have the same 
image under f. Our prescription (A.23) would then naively try to assign to a 
single point f (x1) = f (x2) two different tangent vectors. No such a problem 
arises for the pull-back, which can be extended from tensors of type (0, p) to 
tensor fields for any smooth map f. 

Both of the above problems disappear when f is a diffeomorphism. 
In that case, we have a locally well-defined one-to-one mapping between 


(continued) 
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M and N. The push-forward f, is an isomorphism between the respec- 
tive tangent spaces, and the pull-back f* is an isomorphism between 
the respective cotangent spaces. By combining fs :TxM >T ¢ayN with 
Gy :TEM > Tro, we can map any tensor (field) on M to a tensor 
(field) on N. 


Example A.7 


In Example A.4, I introduced the Euclidean metric on R”, g = 5g,dx% @ dx?, where 
x are the Cartesian coordinates. This metric can be pulled back to any manifold 
M that can be embedded as a hypersurface in R” by a smooth map f. The induced 
metric on M in chosen local coordinates y on M follows from (A.27), 


a b 
fig = Sap Lay’ @ ay! (A.28) 
dy® dy 

The justifies the previously used expression (A.20) for the metric on the 2-sphere. 
In the same manner, one can pull back any differential form defined on R” to the 
embedded manifold M. For instance, the area 2-form (A.22) on the 2-sphere has an 
elegant representation in terms of the natural embedding of S$? in R3. It is left to the 

reader as an exercise to check that (A.22) is recovered by pulling back the 2-form 


1 
5r @ar x dpr)dx® A dx? (A.29) 


in R?, where r = (x, y, z) and the dot and cross denote respectively the conventional 
dot and cross products of 3-vectors. It is common to abuse the notation and represent 
the area 2-form on S? by simply replacing the 3-vector r above with a unit 3-vector 
n that defines the embedding of $? in R*. One can then write the area 2-form on S? 
simply as (1/2)n - (dn x dn). 


A.3.2 Flow of Vector Fields 


In Sect. A.2.1, I emphasized that the concept of a tangent vector does not have 
a literal geometric interpretation unless we embed the manifold M in a higher- 
dimensional Euclidean space. It is however possible to visualize a vector field v 
through the flow it induces on M. Imagine that the manifold is filled with a liquid, 
and think of v(x) as the velocity of the liquid at x € M. We can then extract 
information about the vector field from the trajectories of individual particles of the 
liquid. 
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Let us now formalize this concept. We need a real parameter t € R to measure 
the “time” for the liquid motion. Next, we think of a single particle trajectory as a 
curve y : R + M satisfying the first-order differential equation 


afo) 
a Fy — 


for any test function f : M — R. This is just a coordinate-independent version of 
the relation between the particle’s trajectory and its velocity, dy“/dt = v“(y(t)). 
By choosing the initial condition as y(0) = x and varying x € M, we get a family 
of trajectories that define a map v, : M — M. In terms of the liquid analogy, 
Pv t(x) is the position at time ¢ of a particle that was at x at time t = 0. For smooth 
vector fields v, the flow equation (A.30) is guaranteed to have a (locally) unique 
solution. Hence the map ¢y,+ with fixed v and ¢ is a diffeomorphism on M. We can 
thus use it to push forward tensors from one point of M to another. This is called 
Lie transport. 


Consider the vector field v = —yd, + xdy in R2. The flow equation (A.30) is easily 
solved and gives 


x(t) = xo cost — yosint , y(t) = xo sint + yocost , (A.31) 


where Po = (xo, yo) is the starting point of the flow, see Fig. A.3. From the 
definition (A.23) of push-forward, or directly from (A.24), we then see that the 


Fig. A.3 Illustration of Lie 
transport of tangent vectors in 
R*. The Lie transport is 
generated by the vector field 
v = —yd, + xdy, one of 
whose flow lines is indicated 
by the dashed circle. As the 
point Pp is transported to P, 
the attached basis of tangent 
space at Po (displayed as 
solid arrows) is pushed 
forward to a basis of tangent 
space at P 
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coordinate basis (3x, dy) at Po is transported to 
Pv,txðx = (cos t)ðx + (sin t)ðy , Pv,txðy = —(sint)d, + (cos t)ðy (A.32) 


at P = (x(t), y(t)). The flow generated by v corresponds to a rotation of R? around 
the origin by angle t. Indeed, switching to polar coordinates (0, 0) through (x, y) = 
(o cos 0, ọ sin 0) shows that v = dg. Upon changing the basis of the tangent space, 
(A.32) is then recognized as a trivial map between the coordinate bases at Po and P, 
that is v,txðol Po = olp and dy,1%09| Pp) = 99|p. This is a special case of a general 
rule that when v is one of the coordinate basis fields, all the coordinate basis vectors 
at x are transported to the corresponding coordinate basis vectors at dy +(x). 

As another example, we may choose v = x0, + ydy. In this case, we find 


x(t) = xe , y(t) = yor’ , (A.33) 
and accordingly 
dv, tx Ox = el Ox ; hy, 1% Oy = e dy : (A.34) 


This is another illustration of Lie transport generated by a coordinate basis field. 
We just have to notice that in polar coordinates, v = Q@0g = ogg. Choosing local 
coordinates as (log ọ, 0) reproduces (A.34) as a trivial transport of (logo, 0g) at Po 
to (Ologg, 0g) at P. 


Lie transport plays a critical role for the implementation of symmetries on 
manifolds. Indeed, the map v, for fixed v and varying t constitutes a one- 
parameter group of transformations on M. The group structure amounts to the 
obvious properties 


Out © Pos = tees «COS ye «= Oy Sees (A.35) 


We can think of this as a “representation” of R by an additive group of trans- 
formations on M. It is straightforward to generalize this concept to other (finite 
or Lie) groups. Formally, the action of a group G on M is defined as a set of 
diffeomorphisms T, on M that respect the group structure of G? 


Toi o Tg, = Tig » T, = id, (Ty)! =T, 


gl > 8, 81,82EG. 


(A.36) 


For any Lie group G, we can restrict to a specific one-parameter subgroup. This is 
mapped by (A.30) to a vector field that represents the corresponding generator of G 


3 The concept of group action is introduced in a more pedestrian manner and used in Chap. 7. 
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on M. Combined with (A.35), we deduce that there is a one-to-one correspondence 
between vector fields and one-parameter groups of transformations on M. 


A.3.3 Lie Derivative 


The practical utility of Lie transport is limited due to the necessity to solve the 
set of coupled first-order differential equations (A.30) for the flow. It is usually 
more convenient to consider the variation of tensor fields on M under infinitesimal 
transformations generated by v. This variation is formally encoded in the Lie 
derivative, defined for a given tensor field T as 


— do,teT) 
dt 


= lim T- Pv,exT 
t=0 ~ €30 € l 


LyT = (A.37) 


The overall minus sign is a convention arising from the fact that (dy,¢xT)(x) 
corresponds to the value of T at @y,-<(x), pushed forward to x by the map ġv,e. 

Let us see how this works for vector fields. For the moment, it is practical to 
resort, even if only implicitly, to local coordinates. The vector field v then becomes 
a linear differential operator. The flow equation (A.30) in turn has a formal solution 
for @y,r in terms of the (path-ordered) exponential of v, given implicitly by 


FOr) =e" f(x) . (A.38) 


The definition (A.23) of push-forward now tells us that for any test function f and 
any vector field u, v Ulf] = ul f o dy] = ule” f]. If we start with u(x), this 
will give us the value of Øy su at Øy s(x). To find the value of py su at x, we 
invert (A.38) to f(x) = e™™ f (v t (x)). This leads to 


vt vt 


Qv, xu =e "Ue hence Lyu = [v, u], (A.39) 


where I introduced the commutator of v and u as differential operators, also 
called the Lie bracket. Such a commutator is only meaningful when expressed in 
local coordinates. However, the Lie derivative as defined by (A.37) is coordinate- 
independent. Hence also the Lie bracket is well-defined independently of the choice 
of coordinates. 

Let us pause to ponder on the significance of the Lie bracket. First, as a formal 
commutator, this defines a Lie algebra structure on the space of vector fields on M. 
The Jacobi identity for this Lie algebra is equivalent to the Leibniz (product) rule 
for the Lie derivative, 


Lulv, w] = [Luv, w] + [v, Luw] . (A.40) 


What if we take a special set of vector fields that are the generators of the action of 
a Lie group G on M? We find, not surprisingly, that the Lie brackets of these fields 
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reproduce the Lie algebra g of G. Instead of a formal proof of this statement, let me 
show a simple illustrative example. 


In Example A.8, I introduced a vector field that generates rotations in R? around 
the origin. This is easily promoted to R? where we have three independent rotations 
around the Cartesian coordinate axes, generated by 


Vy = yd, — Zðy , Vy = 20, — XO; , vz = Xy — yd, . (A.41) 
The Lie brackets of these, 
[vx, Vy] = —v;z , [vy, vz] = —vx , [vz, Vx] =—vy, (A.42) 


reproduce the Lie algebra of infinitesimal rotations in R*, s0(3). Further general- 
ization to R” is straightforward and amounts to considering the set of vector fields 
Vab = Xap — Xpða With a, b = 1,...,n. 


Second, having at hand a coordinate basis, it is easy to give a general expression 
for the Lie derivative of a vector field in terms of its components, 


(Lyu)? = v? du? — u? ðpv! = v[ut] — ulv") . (A.43) 


The latter expression clarifies the comment on the relation between coordinate 
basis fields and (local) frames that I made at the end of Sect. A.2.1. Namely, the 
set of coordinate basis fields {34}; _; always has vanishing Lie brackets. Thus the 
necessary condition for the existence of local coordinates y^ such that a frame 
{ea} 4-4 can be represented by e4 = a/ay4 is [e4, eg] = O for all A, B. It turns 
out that this condition is also sufficient; see Chap. 9 of [5] for a proof. 

Let us now return to the general discussion of the Lie derivative. This can be 
evaluated on other tensor fields by making use of what we already know for vector 
fields. For example, let us take a 1-form œw and a test vector field u. Using the 
definition (A.26) of pull-back together with the inverse of (A.38), we get 


(bv,tx@) (bv, 1xU) = (Py, 10) (Pv 124) = e “w(u) : (A.44) 


Combining the general definition (A.37) of the Lie derivative with the Leibniz rule 
then leads immediately to 


(Lyw)(u) = v[o(u)] — w([v, u]) . (A.45) 
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In fact, the same reasoning can be applied without change to any differential p-form. 


For a set of test vector fields u1, .. . , Up, we find 
(Lro) (ui, ...,Up) = vl@(uy, ..., Up) ] 
p (A.46) 


= Solu, ..., tii, [v, Uj],Ujt1,-.-,Up) . 


For tensors of the general type (q, p), it is more straightforward to work with 
components with respect to a coordinate basis. One can think of a tensor T as a 
multilinear map acting on a set of test vector fields and 1-forms. Combining the 
Leibniz rule with (A.39) and (A.45) then leads, after some manipulation, to 


(Login = TEE + TR Om! + THe Agu! bo 


mn- mn--- In-- ml. n 
= TE aut — Te a oo. 
It is common to extend this identity to the special case of (q, p) = (0,0), 


corresponding to functions on M. In this case, the Lie derivative coincides with 
the previously defined directional derivative, Ly f = v[ f]. 

Before closing the section, let me mention a useful identity, connecting the Lie 
derivative of differential forms with the interior product, defined by (A.17). For two 
vector fields u, v, one finds that 


Lu © tw — ty © Lu Sea (A.48) 


This is another variation on the Leibniz rule, as one can easily see be rewriting it as 
Luly) = tu, vj + w (Luw) for an arbitrary test p-form w. The proof proceeds by 
combining the definition (A.17) of interior product with (A.46). 


A.4 Exterior Derivative 


Until now, I have tried to treat on more or less equal footing tensor fields of arbitrary 
type (q, p). However, as Sect. A.2.2 already hinted, the (differential) p-forms have 
a special standing among all the tensors. In this section, I will introduce a new 
differential structure on the Grassmann algebra of differential forms. The central 
concept thereof is the exterior derivative, which is a derivative-like operation that 
converts a p-form into a (p + 1)-form. This is a tool of great importance for much 
of both differential and integral calculus on manifolds. There are several ways of 
introducing the exterior derivative with different balance of abstraction, elegance 
and technical simplicity. While there does not seem to be a unique, intuitively simple 
definition, I will at least try to make a brief comparison of the different approaches. 
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Consider a differential p-form w and a set of p + | test vector fields, v;. We can 
define the exterior derivative of w as the unique (p + 1)-form dw such that 


p+ 


do(v1,..., vp) = X Divile, ..., 6, ..., vp) 
i=1 


+ SED Hollo, vj], Vl, ...; di, eee dj, sey Vp+1) 
i<j 


(A.49) 


for any choice of test fields, where a hat denotes an argument that is to be omitted. 
This definition is manifestly independent of the choice of local coordinates, but not 
exactly intuitive. Let us therefore try to unfold its content in some simple cases. 
First, for 0-forms, that is functions on M, (A.49) gives simply d f (v) = v[ f]. This 
agrees with our previous definition (A.10) of the differential of a function. Second, 
for a 1-form @ and a 2-form Q we get respectively 


dw(u, v) = u[w(v)] — vlw (u)] — œ (lu, v]) , 
dQ (u, v, w) = u[Q(v, w)] + v[Q(w, u)] + w[2 (u, v)] (A.50) 
— Q ([u, v], w) — Q([v, w], u) — 2 ([w, u], v) , 


for arbitrary test vector fields u, v, w. It is becoming clear that for any p-form w, 
dw only contains two types of contributions; the summation in (A.49) is needed to 
ensure that dw maintains full antisymmetry as a multilinear map on vector fields. 
It is possible to show (see Chap. 36 of [6]) that (A.49) generalizes the concept of 
a directional derivative from functions to p-forms. This accounts for the first line 
of (A.49). The second line thereof makes dw local so that dw(v1,..., v prev only 
depends on the values of the test vector fields at the point x, not on their derivatives. 

By picking the test vector fields v; from a coordinate basis and using the 
component expansion of p-forms (A.13), we find 


dw = la dx? A dx” dx ASI 
=n bWay---ap) x Ad A+++ A dx? . (A.51) 
Dp: 


This is in practice the easiest way to calculate the exterior derivative of a differential 
form. It makes manifest the fundamental property of the exterior derivative that 


dow = d(dw) = 0 (A.52) 


for any p-form w. Moreover, it can be used as an alternative definition to (A.49) that 
is technically simple and quite intuitive. The downside of this approach is that it is 
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not obvious that an object defined by (A.51) actually exists, independently of the 
choice of local coordinates. To prove this requires some extra effort. 


In case w is a O-form, that is a function on M, the components of dw are (dw)g = 
daw. This generalizes the gradient of functions in R” known from ordinary vector 
calculus. If, on the other hand, w is a 1-form, it follows at once from (A.51) that the 
components of dw are 


(d@)ab = awb — Opa « (A.53) 


This appears to generalize the concept of a curl of a vector field in R*. The problem 
with this interpretation is that neither w nor dm is a vector field. We will see in 
Sect. A.6.5 how to circumvent this issue. 

Antisymmetric derivatives such as (A.53) are common in physics. If A = Agdx 
represents the potential of an electromagnetic field, then (A.53) are the components 
of the corresponding field-strength tensor, F = dA. In this case, the fundamental 
property (A.52) of exterior derivative encodes the familiar Bianchi identity for the 
electromagnetic field, 


Oc Fab + ða Fhe + Op Fea = 0. (A.54) 


The exterior derivative interconnects nicely with other operations on differential 
forms that we have met. It is for example easy to prove using the coordinate 
definition (A.51) that the exterior derivative is an antiderivation of the Grassmann 
algebra of differential forms. Thus, for any p-form w and any q-form o, 


d(w Ao) = dw Ao + (—1)w A^ do . (A.55) 


The coordinate-free definition (A.49), on the other hand, makes it easier to show that 
the exterior derivative “commutes” with pull-back and hence with Lie derivative. 
This means that for any smooth map f : M — N, differential form w on N and 
vector field v on N we have 


d(f*w) = f* (dow), d(Lyw) = Ly (dw) . (A.56) 


The interior product, Lie derivative and exterior derivative constitute three 
derivative-like operations on differential forms. The first decreases the form degree 
by one, the second leaves it unchanged and the last increases it by one. There is a 
simple identity that brings them all together which, perhaps for its intrinsic beauty, 
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is usually referred to as “Cartan’s magic formula,” 
Ly=wodt+doy or Low = u(dw) + dvo), (A.57) 


for any p-form w and vector field v. The proof is a straightforward combination of 
the coordinate-free definition (A.49), the Lie derivative of differential forms (A.46) 
and the definition (A.17) of interior product. Note how the Cartan magic formula 
together with d o d = 0 immediately implies the second of the identities in (A.56). 


Here is a somewhat more advanced example, closely related to the subject of this 
book. Take a Lie group G and introduce the following object, called the Maurer- 
Cartan (MC) form, 


w = —ig 'dg=o'Q,, geG, (A.58) 


where Q 4 are the generators of the Lie algebra g of G. In order to give a meaning 
to this definition, one should think of g € G in terms of a set of functions on 
G, returning the matrix elements of g in some faithful representation of G. The 
identities (7.30) and (7.31) of the main text of this book guarantee that w takes 
values in the Lie algebra g. Hence the components w“ constitute a set of well- 
defined 1-forms on G. 

Given that the dimension of G as a manifold equals that of its Lie algebra, the 
1-forms w^ furnish a globally well-defined coframe on G. The exterior derivative 
da“ should therefore be a linear combination of w? Aw. Indeed, a short calculation 
shows that the expression of dw in terms of œ? Aw reflects the algebraic structure 
of G as a Lie group, 


dw = —idg' A dg =ig'dg A g-'dg = -iw Ao, 
(A.59) 


1 
do^ = 5 face” A of 7 


where f ae are the structure constants of G. This is known as the MC equation. 


Before closing the quick overview of exterior calculus, let me mention in passing 
another way to define the exterior derivative. This might please a reader who prefers 
a more axiomatic approach. Namely, the exterior derivative turns out to be a unique 
antiderivation of the Grassmann algebra that reduces to the ordinary differential on 
0-forms and satisfies the fundamental property (A.52). For a reader that might prefer 
a more geometric definition of exterior derivative, there is yet another possibility. 
This is intimately related to integration on manifolds and I will return to it in 
Sect. A.7. 
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A.5 Affine Connection 


In order to develop a full-fledged differential calculus on manifolds, we have to be 
able to differentiate not only functions, but also vector and tensor fields. In spite of 
the terminology, none of the two previously introduced “derivatives” is suitable for 
the purpose. First, the Lie derivative £, actually does not have the properties we 
would expect from a derivative in that it is not local in v. The value of £,T for a 
given tensor field T at x € M depends not only on v(x) but also on its derivatives, as 
is clear from (A.47). Second, the exterior derivative is only defined for differential 
forms and not for other tensors. To have a well-defined notion of derivative for tensor 
fields of arbitrary type, we need additional structure on the manifold. 

In order to see what the required structure might be, let us consider the simplest 
case of a test vector field u. We might think of first finding its components u^ in a 
chosen local frame e 4 and then taking their directional derivative v[u“] with respect 
to a fixed vector field v. That is however problematic. The choice of frame e4 is 
arbitrary and can be changed locally. This is a smoking gun of what in field theory 
is called “gauge freedom.” The directional derivative of u must be a well-defined 
vector field independent of such a “choice of gauge.” This is not the case of v[u“Je 4, 
so v[u^] cannot be the components of a well-defined vector field. Following the 
analogy with physics, the problem can be fixed by introducing a gauge field. 


A.5.1  Covariant Derivative 


Let us now formalize the above observation. The basis e4(x) of TyM can be 
changed by any nonsingular linear transformation that varies (smoothly) with x. 
Upon such a transformation, the basis vectors and components of a tangent vector 
u change to 


éa=en(P')?,, w= Pyu’, P € GL(n). (A.60) 


The group GL(n) is our gauge group; in differential geometry, the term structure 
group is common. Any geometrically well-defined object on the manifold M must 
be invariant under gauge transformations of the type (A.60). It is known from field 
theory that in order to have a well-defined notion of derivative of fields, one needs a 
gauge field, or gauge connection. In differential geometry, this object is called affine 
connection. In a given frame, it constitutes a collection of 1-forms Q4 g- Treating 
these as the matrix elements of a matrix-valued 1-form Q, the action of the basis 
transformation (A.60) on the affine connection can be defined compactly as 


Q= PQP! + PaP! , P € GL(n). (A.61) 


This is analogous to a (non-Abelian) gauge transformation familiar from field 
theory. 
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The collection of 1-forms 4 pg carries one vector and one covector frame 
index. In spite of the suggestive notation, the object e4 @ e*? @ Q4 p is not 
a well-defined tensor field of type (1, 2) on M. This is a consequence of the 
second term in (A.61). The affine connection is therefore inseparably tied to 
the chosen local frame. 


Following the analogy with field theory, we now define a directional covariant 
derivative Vu of the vector field u along v component-wise as 


(Vpu)4 = vfu4] + 24, (v)u8 . (A.62) 


It is easy to check that the components (V,w)“ transform under the local change 
of basis (A.60) just like u^. Hence the directional covariant derivative Vyu = 
(Vyu)“e, is a well-defined vector field on M independent of the choice of frame. 
Moreover, V,u is manifestly linear and local in v. We can therefore separate v out 
and only keep a tensor object that encodes the variation of u with respect to the local 
frame. This is done by setting (Vvu) = vË (Ve,u)4 = vë (Vgu)4, where 


(Vgu)* = eplu“]+T acu" , Tec = 24c(en) = (Vaec)’ , (A.63) 


and re. are the Christoffel symbols. These are independent of u and characterize 
the geometry of the manifold and of the local frame. 

The field theory analogy allows us to write down at once the covariant derivative 
of a tensor field of any type (q, p). Different types of tensors transform under 
different representations of the structure group, induced by the fundamental rep- 
resentation (A.60) acting on vectors. We can thus reuse (A.62) by inserting the 
appropriate representation of the affine connection. For instance, tensor fields of 
type (0, 0), that is functions on M, are blind to the change of local frame. This 
corresponds to the trivial representation of GL(n). The covariant derivative then 
reduces to V, f = v[ f]. For a generic tensor T of type (q, p) we find 


(Vine. jos 2 TS + P re a es 
— THB Q ev) TAE Ql w). 


This can also be rewritten in a manner independent of the vector field v, following 


the example of (A.63), 


(Velie. = ec[T48] + TA TB + TB, TA + Ea 
AB pl AB pl ` 
= TiL Tek ~ lect 
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The natural choice of e, in R” is the coordinate basis in the Cartesian coordinates. 
In this basis, we can set Q = O so that Vu = v[uį]ea. This expresses the 
fact known from elementary vector calculus that e4 is a globally well-defined, 
constant basis of vectors. Switching to another local frame @,4 via (A.60) induces a 
nontrivial connection, Q = PdP~!. This characterizes the “variation” of the new 
frame throughout R”. A short manipulation leads to an expression for the Christoffel 
symbols in terms of the derivatives of P, 


Pac = —(dPP7')4C pg) =(P)" eal Pg] 
e 1\D i A g ka KE A (A66) 
=-(P_) BP ) ceDlP g] =—-(P™) BP ) copP E: 


Equations (A.63) and (A.65) then allow us to calculate derivatives of vectors and 
tensors in R” in terms of their components in arbitrary curvilinear coordinates. 


A.5.2 Curvature and Torsion 


We can pursue the analogy with field theory even further. Therein, one uses the 
gauge connection to construct the corresponding field strength which encodes 
information about the gauge field in a gauge-covariant fashion. In the geometric 
language, this leads to the curvature 2-form on the manifold M, 


RA, = dO", + 04 GA 08 5 (A.67) 


The matrix elements R4 p can be put together into a single object, R = e4 & eE @ 
R4 g- Unlike the matrix-valued connection 1-form &, this is a well-defined tensor 
field on M, of type (1, 3), independent of the choice of local frame. It therefore 
carries information about the intrinsic geometry of the manifold. The curvature 2- 
form is related to the commutator of directional covariant derivatives via 


(Vu Vow — VVuw — Viu. 1w)" = R^g(u, vw? , (A.68) 


where u, v and w are vector fields. This identity follows directly from the defini- 
tion (A.62) of covariant derivative upon using the first line of (A.50). Using (A.64), it 
can moreover be immediately generalized to the commutator of covariant derivatives 
of an arbitrary tensor field T, 


AB: nae oa 
(Vu; Vol? — Viu oT) kg.. = R^ u, wT + RB u,v) TR + (A.69) 


— TAB R! (u,v) — T4 RE (u, v) — 
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The affine connection also gives rise to another well-defined tensor field that 
characterizes the geometry of the manifold. Namely, the elements of the dual 
coframe e*“ transform under (A.60) identically to the components of a vector. We 
can then define a covariant exterior derivative of the coframe in parallel with (A.62), 


Petra phe. (A.70) 


This is dubbed the torsion 2-form. It is a collection of 2-forms whose components 
transform under the structure group as a vector. Hence T = e4 Q T4 is a 
well-defined tensor of type (1, 2). The torsion 2-form satisfies an identity similar 
to (A.68), 


(Viv — Vou —[u, v])* = T^ (u, v) . (A.71) 


Owing to the fact that do d = 0, one may expect to find constraints on the 
curvature and torsion 2-forms by taking another (covariant) exterior derivative. 
Indeed, a short manipulation based on the definitions (A.67) and (A.70) leads to 
the so-called Bianchi identities, 


dR4A, +c A^R pg — RAG AQ, = 0, 
(A.72) 
d74 +04, AT? = RA, ret . 


Using the MC form introduced in Example A.11, we can get insight into the 
geometry of Lie groups. The simplest way to introduce an affine connection on 
a Lie group G is to set 24, = 0 in the coframe œf defined by (A.58). The 
definition (A.67) then tells us that the curvature 2-form vanishes. The torsion 2- 
form does not, though. From the definition (A.70) and the MC equation (A.59), we 
get that T4 = (1/2) f fco? A œf, where f SE are the structure constants of G. 

Making © trivial is however not the only choice of connection we can make. 
Given the structure that we have available on the Lie group G, we can try for 
instance 


LS 4 (A.73) 


where A € R is a parameter. This class of connections includes the previous trivial 
case as A = 0. Using the MC equation (A.59), one now finds that 


À 
Rip = (Fiteteo + the fo) o Aw”, 
(A.74) 


1 
APA = (5 -2) ae Aw. 
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If we do not impose any constraints on the structure constants f ine we can generally 
make the curvature 2-form vanish only by returning to the special case A = 0. On 
the other hand, we can make the torsion 2-form vanish by setting à = 1/2. We can 
however not make both 2-forms vanish simultaneously. A generic non-Abelian Lie 
group therefore possesses nontrivial geometry as a manifold. 


Our original motivation for introducing the affine connection was to take account 
of the freedom to choose a local frame at will. This is largely responsible for the 
fact that I have phrased this section entirely in a coordinate-free language so far. 
It is however possible to reformulate the definitions and properties of covariant 
derivative, curvature and torsion in terms of local coordinates. This leads to minor 
simplification of some of the equations in this section. Thus, the action of the basis 
vector eç in (A.65) becomes a mere partial derivative and (A.65) turns into 

(VeT IEP = OTE + DATI + TETH + 


abe pi abe pi CAT) 
=) kee Tas 


The Christoffel symbol herein is defined by a coordinate-basis equivalent of (A.63), 
The = (Vb)? or Vpðe = Theda « (A.76) 


Together with (A.70), this shows that the torsion tensor corresponds to the antisym- 
metric part of the Christoffel symbol in a coordinate basis, 


rE — T4, SV — Vode)? = T" (ðp, de) = TE . (A.77) 


Finally, the left-hand side of the identity (A.68) for the curvature tensor reduces to 
a mere commutator of covariant derivatives, hence 


(Vp Vet — VeVpu)* = R47, (8p, du? = R jpet” . (A.78) 


Likewise, (A.69) gives the commutator of covariant derivatives of an arbitrary tensor 
field in the coordinate basis, 


(VeVaT = VaVe T) SRO eR ae 
‘ f ‘ ; (A.79) 
a Rpa Tie eg 
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A.6 Riemannian Geometry 


My exposition of differential geometry has so far been largely ahistorical. I made a 
conscious effort to only introduce the structure that was strictly necessary in order to 
develop the elementary concepts of calculus on manifolds. Thus, it only requires the 
basic differential structure on a manifold to define tensor fields, their Lie derivative, 
and the exterior calculus of differential forms. The first time that a new, additional 
structure was needed was in the previous section. There, we saw how the concept of 
affine connection naturally arises from the need to compute (covariant) derivatives 
of tensor fields. 

However, as the Greek origin of the word “geometry” suggests, much of the 
early progress in the subject was driven by the practical problem of measuring 
distance. The concept of distance is very natural when the manifold in question 
represents space or spacetime in physics. Yet, even more abstract realizations of 
manifolds often possess a quantitative notion of closeness that is more refined than 
mere topology. The purpose of this section is to introduce the structure needed to 
measure distance on a manifold. Despite being anchored in geometry, we shall see 
that this will also give us further useful tools for our quest to develop calculus on 
manifolds. 


A.6.1 Riemannian Metric 


Since I have already mentioned the concept of metric before, let me start right away 
with the formal definition. A Riemannian metric g on a manifold M is a tensor 
field of type (0, 2) such that for any x € M, g(x) is a symmetric, positive-definite 
bilinear form on TyM. See Examples A.4 and A.5. A manifold endowed with a 
Riemannian metric is called a Riemannian manifold. It is common to represent the 
metric by its components in a given coframe or dual coordinate basis, 


g = gape Q e? = gandx" @ dx” , (A.80) 


where gap = g(e@4,ep) and gap = g(0q, 0p). A Riemannian metric gives by 
construction rise to an inner product, both locally of tangent vectors and of vector 
fields. It is practically convenient to have a shorthand notation for this inner product, 


(u, v) = g(u, v) = gagu’ v’ , (A.81) 


where u, v are arbitrary vector fields. 

For some applications, the axioms for Riemannian metric are too restrictive. 
By removing the requirement of positive-definiteness and insisting merely that g 
as a bilinear form is nondegenerate, one obtains a generalization called pseudo- 
Riemannian metric. Likewise, manifolds endowed with a pseudo-Riemannian 
metric are known as pseudo-Riemannian manifolds. This broader concept finds 
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an important application in the general theory of relativity, which assumes the 
physical spacetime to be a Lorentzian manifold. The latter possesses by definition 
a metric with signature (—, +, +,...) or (+, —, —, ...), depending on the con- 
vention adopted. Most of the contents of this section apply, with occasional minor 
adjustments, to Riemannian and pseudo-Riemannian manifolds alike. However, 
within the context of the book, pseudo-Riemannian manifolds play a little role. I will 
therefore content myself with this remark of caution and focus solely on Riemannian 
manifolds in the following. 

Before illustrating the utility of the metric, it is worthwhile to contemplate its 
very definition. How is this actually related to the distance of two different points 
on the manifold? First, the metric by definition acts point-wise on tangent vectors. 
Second, the tangent vectors themselves were defined as differential operators on 
local test functions. To connect the abstract formalism to the down-to-earth problem 
of measuring distance, it helps to recall the concept of flow generated by the vector 
field v, introduced in Sect. A.3.2. One can interpret (v, v) as the squared length 
of the vector tangent to the curve (trajectory) y(t) defined by (A.30). In the liquid 
analogy of the flow generated by v, ./(v, v} becomes the speed of the liquid particle. 
The distance of points x1,2 = y(t1,2) measured along the curve y (t) is then naturally 
defined by integration, 


t2 

dy (x1, x2) =f Vew dt . (A.82) 
ti 

This distance is independent of the choice of parameterization of the trajectory, that 

is of the speed at which the trajectory is traversed by the liquid particle. It is therefore 

an intrinsic property of the curve y as a set of points on M. 

Now that we know how to measure the length of a curve on M, we define the 
distance of points xı, x2 € M as the minimum possible value of d, (x1, x2). This is 
a problem in variational calculus that leads to a second-order differential equation 
for y (t) (see Sect. 11.5.1 of [1]), 


dy4 oe dy? dy! _ 


+4 A.83 
dr? be dt dt aie 
Here the set of coefficients ib ç İs determined by the Riemannian metric, 

^a 1 ad 

Pez 38 (Op8dc + Ic8bd — d8bc) » (A.84) 


where ge are the matrix elements of the inverse of gap. Equation (A.83) is known as 
the geodesic equation. The resemblance between its left-hand side and the covariant 
derivative Vv with v? = dy“/dt is not accidental. The coefficients (A.84) are 
the Christoffel symbols of a special connection on the Riemannian manifold M, 
called the Levi-Civita (LC) connection. The fact that i is symmetric in its lower 
indices implies by means of (A.77) that the LC connection has vanishing torsion. A 
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given Riemannian manifold can support many different affine connections. The LC 
connection is however unique in that it is torsion-free and makes the metric of the 
manifold covariantly constant (see Sect. 7.4 of [2]), 


Vea = Oc Bab = gabli, = eagl =0. (A.85) 


Example A.14 


The origin of the LC connection can be understood in elementary terms. Imagine 
that the manifold M is smoothly embedded in R” for some n > dim M. We can 
then rely on the geometric intuition whereby tangent vectors to M are “realized” as 
vectors in R” by the embedding f : M — R”, as in Example A.6. I will take the 
liberty to identify tangent vectors to M with their images under the push-forward 
A 

Suppose we want to define the derivative of a vector field u on M along the vector 
field v. To do so, we need to compare u(x) at a given point x € M with u(x + €v), 
where the shorthand notation x + ev indicates the point on M with coordinates 
x" + ev" (x). We cannot do this directly since the tangent spaces to M at x and 
x + €v are different. What we will do instead is take the vector u(x), transport it as 
a constant vector in R” to x + ev and then project it back to the hypersurface M. 
See Fig. A.4 for an illustration of the procedure. 

It is easiest to use a frame e4 on M that is orthonormal with respect to the 
Euclidean metric in R”. The projection to M is then accomplished by taking the 
inner product of u(x) and e4(x + ev). This leads to a definition of covariant 
derivative on M as 


A 
(V,u)4Q) = lim u(x +€v) — eae + ev), u(x)) (A.86) 


Comparing this to (A.63) allows us to identify the corresponding Christoffel 
symbols, 


Pac = —(dpea, ec) = (ea, Opec) , (A.87) 


where ðge 4 (x) is defined as lime_,o[e4(x+€eg)—e 4 (x)]/e. We have thus managed 
to define an induced affine connection on M in a way that encodes the variation of 
the orthonormal frame e4 as a set of vectors in R”. It is an easy exercise to check 
using (A.71) that this affine connection is torsion-free. Moreover, with the induced 
metric on M given by gap = (e4, eg) = ĉap, one readily checks using (A.65) that 
the connection (A.87) makes the metric covariantly constant. Using the embedding 
in R”, we have therefore found the (unique) LC connection on M. 

As a side remark, recall that the comparison of tangent vectors at different points 
on M was made possible by “transporting” them as constant vectors in R”. This 
intuition lies behind an often-used geometric picture of affine connection in terms of 
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Fig. A.4 Comparison of tangent vectors to M at different points P and P’ is possible if the 
manifold is embedded in R”; here shown for simplicity for one-dimensional M embedded in R?. 
A tangent vector at P € M, up, is transported to P’ as a constant vector in R?. Subsequently, 
it is projected to the tangent space to M at P’, resulting in the vector u pp’. This can now be 
compared to any other tangent vector at P’. The distance of points P and P’ is exaggerated for 
clarity; for the definition of covariant derivative, it should actually be infinitesimal 


parallel transport. In general, a tensor field T is said to be parallel-transported along 
the vector field v if VT = 0. The geodesic equation (A.83) then describes curves 
whose tangent vector is parallel-transported along the curve. The solutions of (A.83) 
thus have the dual interpretation as the shortest (or more precisely extremal) or the 
“straightest” paths on M. See for example Sect. 7.2 of [2] for more details. 


The metric obviously provides us with a crucial piece of information about the 
intrinsic geometry of a Riemannian manifold. It is nevertheless also invaluable for 
the calculus on manifolds, as it allows us to relate tensors of different types. It is 
known from elementary linear algebra that a vector space such as 7, M for fixed 
x € M and its dual, 7; M, have the same dimension. As such, they are necessarily 
isomorphic. There is, however, no natural way to set up the isomorphism without 
first fixing a basis in the two spaces. Here the metric comes to rescue. It allows 
us to define two mutually inverse musical isomorphisms, b : TyM —> T¥M and 
1: TŽM — TM, and by extension isomorphisms between vector fields and 1- 
forms. Thus, any vector field v is mapped to its flat v’, which is a 1-form such that 


v (u) = (v,u) for any vector field u . (A.88) 


406 A Elements of Differential Geometry 


Vice versa, any differential 1-form w is mapped to its sharp w*, which is a vector 
field such that 


lot, v) =w(v) for any vector field v . (A.89) 


Most theoretical physicists are familiar with these maps in their component forms, 


b B A AB 
V4 = SABY , ott =g og. (A.90) 
These correspond to the usual lowering and raising of indices by the metric and 
its inverse. The generalization of these operations to tensors of other types is 


straightforward and I will not dwell on details. 


It is common in the physics literature to denote a tensor and its image under 
b or ġ by the same symbols, and only distinguish them by the positions of 
their indices. In a restricted sense, it can even be meaningful to identify a 
vector with its image under b or a 1-form with its image under #. Suppose 
that we choose an orthonormal frame on M, that is a frame e, such that 
(ea,eB) = d,p. Then it follows from (A.88) that b maps this basis to the 
corresponding dual basis, e = e*4, The same correspondence applies to 
the inverse isomorphism, (e*4)* = e4. By linearity, the components of any 
vector then remain unaffected by b, and the components of any 1-form remain 
unaffected by {. 


A.6.2 Isometries of Riemannian Metric 


The whole book is devoted to symmetries and their consequences in quantum field 
theory. But what do we mean by a symmetry in case of a manifold? I already 
pointed out in Sect. A.3.2 that a group G can act on a manifold M via a set of 
transformations (diffeomorphisms) on M. However, not every transformation on M 
is asymmetry. For that, the transformation should in some sense respect the structure 
of the manifold. The most common type of symmetry one meets in differential 
geometry is one that preserves the Riemannian metric of the manifold. 

Formally, a diffeomorphism f : M — M is called an isometry if it leaves the 
Riemannian metric g on M invariant, f*g = g. Equivalently, one may require that 
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the inner product of test vector fields u, v is unaffected by push-forward by f,* 
(Fatt, Fev) ec) =(u,v)|. foranyxe M. (A.91) 


The set of isometries of a Riemannian manifold forms a group G under map 
composition. This group may be both discrete, or even trivial, and continuous. It is 
however generally not easy to determine the isometry group G of a given manifold 
M directly, unless one has some additional insight into the structure of M. 

The task is much easier when G is a Lie group. We can then focus on symmetries 
of the metric under infinitesimal transformations. In line with the discussion in 
Sect. A.3.3, this amounts to solving the condition Ls g = 0, where € is a vector field 
representing a generator of the isometry group on M. This is known as the Killing 
vector field, or more briefly Killing vector. The condition on & is equivalent to a set 
of homogeneous linear partial differential equations for the components of £. These 
are usually much easier to solve than (A.91). In practice, it is often convenient to use 
an alternative formulation of the invariance condition, based on the LC connection 
instead of the Lie derivative, 


Leg=0 © (Va8?)y + (CE) = 0. (A.92) 


One then first solves for the 1-form ¿”° and subsequently recovers the Killing vector 
field £ by acting on £” with ț. 


Example A.15 


For illustration, let us find the continuous isometries of the Euclidean metric, g = 
Sdabdx? @dx?, in R”. Dropping for simplicity the superscript b on £, and using the fact 
that the LC connection of the Euclidean metric is trivial in Cartesian coordinates, 
(A.92) becomes 


dab + béa = 0. (A.93) 


Hence ap must be antisymmetric in its two indices. Using this repeatedly together 
with the symmetry of second partial derivatives, we infer 


Oa 0p&e = —3aðcEb = IpOcEa = — 3p ðaEc = —Iadpéc . (A.94) 


It follows that all second partial derivatives of £, must vanish. This conclusion has a 
neat generalization to an arbitrary Riemannian manifold. Following essentially the 


4 This definition can at once be generalized to a local isometry, which is a smooth but not 
necessarily invertible map between two different manifolds, f : M —> N, satisfying (A.91). 
(The inner products on the two sides of the equation are then defined with the respective metrics 
on M and N.) In this case, it is of course not appropriate to speak of a symmetry of either M or N. 
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same reasoning, one finds that 

VaVbic = aR? 4, OF VaVobo = R pagë" » (A.95) 
where Rs, is the curvature tensor of the LC connection. The outer covariant 
derivative in (A.95) is defined by treating V,& as a rank-2 tensor. 


Together with (A.93), the vanishing of second partial derivatives of & implies 
the most general solution of the Killing equation (A.92) for the Euclidean metric, 


Eq(x) = Qa + Bapx? . (A.96) 


Here a, is a set of constants and Bap a constant antisymmetric matrix. This 
corresponds to a combination of infinitesimal translations and rotations, which 
together generate the Euclidean group ISO(n). 


The Killing equation only gives us information about the Lie algebra of the 
isometry group G. The reconstruction of the whole group thus suffers from the usual 
ambiguities of the correspondence between Lie groups and Lie algebras. In addition, 
(A.92) is based on local information about the metric structure of the manifold M. 
The global topology of M may select only a subalgebra of solutions to (A.92). The 
following example, adapted from Sect. 4.6 of [3], illustrates this subtlety. 


Example A.16 


The set T = {(c1,...,Cn)|Ca E€ Z} defines a lattice of points in R” with 
integer coordinates. The quotient set R” /T is the n-dimensional torus, T”. In local 
Cartesian coordinates, the Euclidean metric on T” takes the same form as in R”, 
g = 54pdx% @ dx’. The Killing equation (A.92) therefore produces the same Lie 
algebra of local solutions as in Example A.15. However, not all the solutions (A.96) 
are well-defined 1-forms on T”. In fact, the matrix ap has to vanish; the torus 
does not have continuous rotation symmetries. Moreover, the continuous translation 
symmetry of T” differs from that of R”. It is U(1)*” for the torus but R” for the 
Euclidean space. In addition, the torus has a discrete group of rotation isometries, 
which corresponds to the point group of the lattice I. Such discrete isometries are 
of course not captured by the Killing equation (A.92) at all. 


A.6.3 Symmetries of Curvature Tensor 


Before developing further the basics of Riemannian geometry, I will digress to put 
together some useful properties of the curvature tensor Riro p- This is defined 
in any local frame by RS (u, v) = Ri popu v”, where u, v are arbitrary vector 
fields. The first property that follows immediately from the definition of R4 pasa 
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2-form is the antisymmetry, 
R4,(u, v) = —R4,(0,u) or RAgcp = —R4gpc - (A.97) 


This is valid for the curvature tensor based on any connection on the manifold. 

Next comes a pair of relations that require the connection to be torsion-free. 
However, it does not have to be metric-compatible, nor does the manifold have to be 
endowed with a metric to start with. For vanishing torsion, the second of the Bianchi 
identities (A.72) reduces to R^ B^ e*8 = 0. This translates to 


RA, (u, v)w? + R4Ag(v, wu? + R^g(w, u)v? =0, 
y i A 7 (A.98) 
R” gcp + R cop +R pac = 9; 


which is incidentally also known as the (first or algebraic) Bianchi identity. 
Likewise, the first line of (A.72) leads to 


(Vu R) 3 (v, w) + (VyR)4 p(w, u) + (Vu R) pu, v) = 0, 
(A.99) 
(VER) gcp + (VcR)įgpe + (VDR) BEC =0, 


known as the second or differential Bianchi identity. To prove this in a coordinate- 
free fashion requires certain amount of manipulation combining (A.72) with (A.50) 
and the identity V,v — Vyu = [u,v], which follows from (A.71) for vanishing 
torsion. 

In presence of a Riemannian metric, the curvature tensor of the LC connection 
(the Riemann curvature tensor) has further properties that make it particularly 
symmetric. The starting point for uncovering these properties is the relation 


A A 


u[(v, w)] = Vu (v, w) = (Vuv, w) + (v, VW) ; (A.100) 


This follows at once from the Leibniz rule for the covariant derivative and the fact 
that the LC connection is metric-compatible. By a repeated application of (A.100), 
it is straightforward to show using (A.68) that 


(z, Ru, v)w) = — (w, R(u, v)z) . (A.101) 
Here u, v, w, z are arbitrary vector fields and R(u, v)w is a shorthand notation for 


ea R^ glu, v)w®. Upon lowering the first index of the curvature tensor with the 
metric, (A.101) acquires a particularly simple component form, 


Raspcp = —RBACD - (A.102) 
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Finally, combining (A.102) with the algebraic Bianchi identity (A.98) shows that 
the type-(0, 4) tensor Ragcp is symmetric under swapping its two pairs of indices, 


(z, Ru, v)w) = (v, R(w,z)u) or Ragcp = RCDAB. (A.103) 


A.6.4 Geodesic Normal Coordinates 


While the general language of differential geometry is largely coordinate-free, it is 
often convenient to resort to specific coordinates tailored to the problem at hand. For 
instance, in Euclidean space, the global Cartesian coordinates play a privileged role. 
On a general Riemannian manifold, one can similarly introduce coordinates x“ that 
are “locally Cartesian” (see Chap. 5 of [7] for details). These are defined around an 
arbitrarily chosen point x9 € M and have the following properties. First, the point 
xo itself is mapped to the origin, that is x7 = 0. Second, the Riemannian metric 
at xo reduces to the standard Euclidean metric, gap(x0) = dap. (Any other choice 
amounts to a mere change of basis of 7% M.) Third, all the first derivatives 3egap of 
the metric, and thus also the Christoffel symbols Î b o of the LC connection, vanish 
at xo. Finally, geodesics, that is solutions to (A.83), passing through xo are straight 
lines y“(t) = v“t with some fixed vector v € Ty, M. 

Such normal coordinates can be constructed explicitly, if somewhat formally, 
in a neighborhood of xo using the last of the properties listed above. One chooses 
arbitrarily a basis eg of the tangent space Tyo M. For any vector v = ve, E€ Tx,M, 
one then denotes as yy the geodesic satisfying the initial conditions y“(0) = 0 and 
dy“ (0)/dt = v*. The normal coordinates of the point y(t) are defined as x° = v“t. 

The normal coordinates make it easy to check some differential-geometric 
identities involving covariant derivatives with respect to the LC connection. For 
instance, (A.47) is immediately seen to be equivalent to 

(Loin = UVTI me + Te Omo) + Te (Vay! po 
Ibe os Like eds xh (A.104) 
= Tan VIE = Tnn VERY Oe a 
Indeed, at any chosen point xo, the normal coordinates make the Christoffel 
symbols vanish so that (A.104) agrees with (A.47). Since both sides of (A.104) 
are well-defined tensors on M, the identity must then hold for any choice of local 
coordinates. 


Example A.17 


For an illustration, consider the Lie derivative of a Riemannian metric, 


(Lv 8)ab = Scb(Vav) + gac(Vpv)° . (A.105) 
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Here the first term in (A.104) drops owing to the fact that the metric is covariantly 
constant. This immediately recovers the equivalence of the two formulations of the 
Killing equation in (A.92). 


Perhaps even more importantly, normal coordinates allow us to systematically 
compute gab(x) as a power series in x“ and the Riemann curvature tensor and its 
covariant derivatives at x9. Up to fourth order in the coordinates, the series reads 


1 A 
8ab(X) = gab (0) — > Cxi Racha (0) — o Mea (A.106) 
1 
S rie rae V ¢R)acba (0) + = x1 xx! Racgd (0) R* pp (0) ekg 


where I lowered the first index of R°, eq USing the metric. The derivation of (A.106) 
is technical and I will therefore skip details. An interested reader will find a 
justification and a compilation of further results in an accessible form in [8]. 

A locally symmetric space is a special type of a Riemannian manifold for which 
the first covariant derivative of the Riemann curvature tensor (hence also all higher 
covariant derivatives) vanishes. See Chap. 10, in particular Theorem 10.19, of [7]. 
For such manifolds, gagp(x) can be expressed in a closed form in terms of the 
curvature tensor at x9. Again skipping details, the final result reads 


sin | R(x) : (= DEH ĝ ia 
Bap) = Bac(O) | Bo |= OD Gree Gap Ret, » 
b 


(A.107) 


where R4, (x) = R%,,(O)x°x?. 


A.6.5 Hodge Star 


Using the # isomorphism, the inner product induced by the metric can be extended 
from vector fields to 1-forms. Thus, for any two 1-forms w and o, we set 


(o,0) = lat, oË) = gagot? = g^ wag (A.108) 


This in turn extends to two differential forms w, o of an arbitrary (equal) degree p, 


A, By ApBp 


1 
(0,0) = pe g OA, --ApOBi Bp - (A.109) 


To see what this inner product of differential forms might be good for, we first 
pick an orthonormal frame e 4. Then we define a new top-dimensional form (n-form) 


412 A Elements of Differential Geometry 


in terms of the corresponding dual coframe, 


vol Se" Asn, (A.110) 
For any other local frame, e4 = ene)? 5 with P e GL(n), we get 
vol(@},...,@,) = det P~!. This corresponds to the (oriented) volume of the 


parallelepiped defined by the vectors @4. The n-form (A.110) is known as the 
volume form, hence the notation “vol.” Provided that the two local frames have the 
same orientation, that is det P is positive, the volume form can be expressed in the 
new dual coframe é*4 as 


vol = det Pe! A... AO" = SZ A. AE. (A.111) 


Here g is the determinant of the matrix representation of the metric in the new 
coframe, g4B = (P7!7 P—!) ap. 


The frame e4 is in general defined only locally. We thus need to apply the 
definition (A.110) to an atlas of open sets that covers the manifold M. In 
order that the volume form is well-defined and nonvanishing on the whole of 
M, we have to make sure that local frames on mutually overlapping charts 
have the same orientation. The existence of such an atlas and a set of equally 
oriented local frames is the defining property of an orientable manifold. 


I now get to the main idea of this subsection. At any point x € M, the spaces 
Q2 M and Q% P M of p-forms and (n — p)-forms have the same dimension, (p) 
and thus are isomorphic. In order to define a natural isomorphism between these 
spaces, and by extension between differential p-forms and (n — p)-forms, we need 
both the inner product (A.109) and the volume form (A.110). A given p-form w is 
then mapped to the unique (n — p)-form xœ such that 


o A (*@) = (0, w) vol (A.112) 


for any test p-form o. The inner product is needed to make (A.112) linear in both w 
and o. Likewise, the volume form serves as a reference n-form to compare o ^ (x œw) 
to. The linear operator x is known as the Hodge star, and xw is referred to as the 
Hodge dual of w. In the special case of p = 0, (A.112) gives a simple coordinate- 
free definition of the volume form, vol = x 1. 

The definition (A.112) is coordinate-free and elegant, but also not very transpar- 
ent. For practical purposes, it is almost always more convenient to use instead the 
component expression for the Hodge dual, 


V8 gAtBi... gApB 


*o = —Y° _ gr Brag A EBB, eP A--»Ae* en | (A113) 
p\(n — p)! P 
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Here €4,...4, is the fully antisymmetric LC symbol, equal to +1 when Aj,..., An 
is an even permutation of 1,...,”, to —1 when Aj,..., An is an odd permutation 
of 1,...,, and to 0 otherwise. It is not to be confused with the Levi-Civita tensor, 
/8 €A, A,» Which by (A.111) gives the components of the volume form. 

It is clear that by applying the Hodge star x twice, we map a p-form w to another 
p-form, x» w. What is less clear but easy to check is that the two forms are equal 
up to a sign,’ 


xxo = (-1)?"-)@, or equivalently «7! w= (-1)?"") xw.  (A.114) 


Being an isomorphism on the Grassmann algebra of differential forms, the Hodge 
star can be used to conjugate other linear maps defined on forms. The most important 
example of such a conjugate operator is the codifferential 5. This is defined on p- 
forms by conjugating d with an extra conventional minus sign, 


bor (=$1) x7! de œ = (PH y dxo. (A.115) 


It follows from the conjugation and from (A.52) that 8 o è = 0. However, unlike d 
itself, the codifferential is not an antiderivation of the Grassmann algebra. 

It is possible to conjugate in the same way the interior product (A.17). This 
does not lead to a new interesting operator though. Indeed, combining (A.17) 
with (A.113), it is straightforward to check that for any p-form @ and vector field v, 


(C(I uros. (A.116) 


Finally, one can similarly conjugate the Lie derivative £,. Thanks to the Cartan 
magic formula (A.57), the result is fixed by the conjugation of d and ty. It is however 
not particularly illuminating and I will therefore not show further details. 

Let us now get back to the codifferential. This together with d furnishes a 
powerful toolkit that generalizes much of the differential structure familiar from 
ordinary vector calculus. The exterior derivative itself generalizes the notion of 
a gradient of scalar fields in R” and curl of vector fields in R*, as noted in 
Example A.10. This leads to a far-reaching generalization of integral theorems in 
vector calculus that I will address in Sect. A.7. In R” we also have the divergence 
operator that turns a vector field into a scalar. We can construct such a map on any 
orientable manifold M by combining the flat b and the codifferential. Thus, for any 
vector field v, 


1 


ges vi) = (Vav)" , (A.117) 


div v = —8v? = 


5 Here and occasionally in the following, I drop the o symbol indicating composition of operators 
to prevent the equations from becoming awkward. 
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where g now stands for the determinant of the matrix representation of the 
metric in the dual coordinate basis dx“. In the last expression in (A.117), the LC 
connection (A.84) is implicitly assumed. The next-to-last expression in (A.117) is a 
special case of the component form of the codifferential of a p-form, 


os 
V8 


This follows directly from (A.113) and (A.115) upon some manipulation. Last but 
not least, there is another, rather elegant way of computing the divergence of a 
vector field through (div v) vol = £» vol. This is an infinitesimal version of the 
familiar fact from multivariate calculus that upon a transformation of coordinates, 
the volume measure in an integral changes by the Jacobian of the transformation. 


The component expression (A.118) makes it easy to see why the codifferential is not 
an antiderivation of the Grassmann algebra. Should 6 satisfy an identity analogous 
to (A.55), we would have, for instance, for any 0-form f and p-form œw that 


Boara, = db, [VE e] gap, Bashy - (A.118) 


MfAw) =8frotfAso=faso. (A.119) 


Here I used the fact that èf = 0 for any 0-form f, which is a consequence of the 
fact that d vanishes on any n-form. But (A.119) cannot be correct, since (A.118) 
makes it clear that 8( f A œw) must depend on both f and its derivatives. 


The Noether current of a continuous symmetry of a physical system is a divergence- 
free vector field J, that is 8J? = 0. The latter condition is naturally generalized 
to 8J = 0 for any “p-form current” J. Such a generalized conservation law is 
via Noether’s theorem associated with a so-called (p — 1)-form symmetry [9]. 
Additional insight follows from writing the conservation condition 8J = 0 
equivalently as dx J = 0. Namely, if J is a conserved p-form current and K a 
conserved q-form current, then the exterior derivative of the (2n — p — q)-form 
xJ A *K vanishes thanks to (A.55). Hence x(x J A x K) is a (p + q —n)-form 
current that is also conserved. In this way, we can use already known generalized 
conservation laws to construct new ones as long as p +q > n [10]. 


Finally, by combining d and 8, we can construct a “second-order differential 
operator” which maps p-forms to p-forms. While it would in principle be possible 
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to consider separately d o è and 6 o d, it is their sum that is of special interest, 
A= (d+8)? =do8+58o0d. (A.120) 


It follows at once from (A.117) that on 0-forms this reduces to 


1 — 
Af =-—4, aba, f) = —g9V Vif , A.121 
f (Ves Hf) =-8 oF ( ) 


where the second expression again requires the LC connection. Thus, A is a gen- 
eralization of the Laplace operator to differential forms on orientable Riemannian 
manifolds. It is known as the Laplace—de Rham operator, or alternatively as the 
Hodge Laplacian. 


A.7 Integration on Manifolds 


So far I have discussed solely the differential structure on manifolds and related 
concepts. It is finally time to address the other essential ingredient of calculus: 
integration. Following the analogy with multivariate calculus in R”, one might want 
to integrate functions on a given manifold M. Without the privilege of having 
globally well-defined Cartesian coordinates, it however turns out more natural to 
integrate forms. I will show how to do this in the next subsection. The abstract 
notion of integration of forms can nevertheless be bypassed in case the manifold 
possesses a metric structure. It is then possible to sweep the differential form nature 
of integration under the rug. This results into a definition of integration in terms of 
a volume measure determined by the metric, which is the subject of Sect. A.7.2. 


A.7.1  Orientable Manifolds 


A well-defined concept of integration on a manifold must have a geometric meaning 
independent of the choice of coordinates. It makes no sense to just pick random 
coordinates x“ and define an integral of a function f : M — R via something like 


f s= (f og !)(x) d” x (wrong) , (A.122) 
M (M) 


where g : M — R” is the (possibly only locally defined) coordinate map. This 
naive attempt cannot be fixed even by adding an integration measure that would 
correctly account for changes of coordinates. The problem is that without further 
structure on the manifold, there is no unique measure we could use for the purpose. 
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It turns out that differential forms have all the ingredients we need. Let us 
consider an n-form w on an n-dimensional manifold M. In chosen local coordinates, 
it is given by (A.13), 


w = — Owa 
n! 


a dx” A A dx™ = @gndx! A- A dx” . (A.123) 
Under the change of coordinates from x“ to x“(x), the single independent com- 
ponent w1...n(x) transforms by the inverse of the Jacobian of the coordinate 
transformation, &1...n (X) = |dx/dx| -l w]...n(x). It therefore appears that all we have 
to do is to strip off the factor dx! A.-- A dx” in (A.123) and integrate w1...n (x) as 
a function on R”. Upon a change of coordinates, the transformation of w1...n will 
exactly cancel the Jacobian generated by the Euclidean volume measure d” x. 

There are two flies in the ointment though. First, a closer look reveals that 
while @).... indeed changes by the inverse of the Jacobian of the coordinate 
transformation, the Euclidean measure d”x changes by its absolute value. Another 
manifestation of the same problem is that d” x is invariant under any permutation of 
the coordinates, whereas dx! A --- A dx” and thus @...n changes sign under odd 
permutations. Yet in other words, (dx! A---Adx”)(v1,..., vn) returns the oriented 
volume of the parallelepiped defined by the test vectors vj,...,v, in R”. The 
second, related problem is that we rarely have a set of globally defined coordinates 
on the manifold. It is mandatory to ensure that we can consistently sew together 
integrals of w over individual coordinate patches. This requires that the coordinate 
systems in overlapping patches have equal orientation. For that to be possible at all, 
the manifold M must be orientable. Even on orientable manifolds, we still have the 
freedom to choose global orientation. Fixing the orientation requires that we restrict 
coordinate transformations to those with a positive Jacobian. At the end of the day, 
the integral of an n-form will be defined up to a sign, depending on the choice of 
global orientation. 

The problem of how to sew together integrals of w over individual coordinate 
patches requires more careful consideration. The idea is to “triangulate” the 
manifold in a way that each cell of the triangulation lies in a single coordinate 
patch. To implement this strategy, we start by defining what such an elementary 
cell corresponds to in R”. A p-simplex (Po,..., Pp), where 0 < p < n, is the 
convex envelope of (that is the smallest convex set containing) the set of points 
Po, ..., Pp € R”. For the triangulation approach to integration to be successful, 
we next need to be able to put together two or more simplexes. This is done by 
defining a set, Cp, whose elements, called p-chains, are formal linear combinations 
of different p-simplexes. 


The introduction of p-simplexes was motivated by the desire to triangulate a 
given manifold M. It would therefore appear most natural to only consider 


(continued) 
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linear combinations of simplexes with coefficients +1. It will however turn 
out convenient to allow for arbitrary real coefficients, thus making Cp a real 
vector space. The advantage of doing so will become apparent in Sect. A.8 
where I introduce the dual structure to p-chains in terms of differential forms. 


For integration of differential forms, we need to define an orientation of the p- 
simplex. We can do this by identifying (Pz(0), - - - , Prcp)) with (sgn) (Po, ..., Pp) 
for any permutation z on the indices 0,..., p. With this convention, we can now 
also define an oriented boundary of a p-simplex. This is formalized by the boundary 
operator, which is a linear map 0p) : Cp —> Cp-1 such that 


Pp 
3p Pie) = YU Paks Pii, Pitis -o Pp) (A.124) 
i=0 


The fundamental property of the boundary operator is that when applied twice in 
succession, it gives Zero, ðp—1 0 Op = 0. 

Eventually, we would like to construct a triangulation of a manifold M by lifting 
simplexes from R” to M. To that end, it is sufficient to consider just one, standard 
p-simplex sp in R”. This is defined to have Pp at the origin and all the other vertices 
on the positive Cartesian semiaxes at unit distance from the origin. The Cartesian 
coordinates of the vertex P; are thus Pf = ôf. Now we can take any smooth map 
f : Sp > M. Its image, op = f (sp), is called a singular p-simplex on M. By 
extension, a singular p-chain c on M is a formal linear combination, 


c=) cio, = cifi(sp) whereci ER, (A.125) 
i i 


and oi are singular p-simplexes. Finally, the boundary of a singular p-simplex is 
defined by lifting the boundary of sp with the same map f, dpop) = fps” 
After the rather long series of formal steps, we are finally ready to define integra- 
tion on the manifold M. In fact, we can do even more than we originally anticipated, 
namely integrate p-forms on any singular p-chain on M such as (A.125), with 
any 0 < p < n. The task is first reduced to individual singular p-simplexes by 
linearity. Then, the integral is pulled back to R? using the map defining the singular 


6 Here I am tacitly extending the action of f from the standard p-simplex to linear combinations 
of p-simplexes in R” by linearity. 
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p-simplex. At the end of the day, one sets for any p-form w and singular p-chain c 


fos fio. [o=f fros | (f'*@)1...p(x)d?x.  (A.126) 


To achieve the original goal to define integration of forms on manifolds, it remains 
to clarify how to represent an n-dimensional manifold M as a singular n-chain. 
There is obviously considerable freedom in doing so, but to identify M with an n- 
chain c = J; cio}, the latter must satisfy several conditions. First, the simplexes o} 
must cover the whole of M without overlaps. Second, each a should lie in a single 
coordinate patch. Third, if the local coordinate system x“ in which ø} is mapped to 
the standard n-simplex s„ is compatible with the global orientation of the manifold 
M, we should set c; = 1. If the orientation is opposite, we have to set c; = —1. In 
practice, all this is easier than it might look, as the following example shows. 


Example A.20 


The spherical coordinates (0, g) on S ? take values from the open rectangle (0, 77) x 
(0, 277) in R? and cover the whole of $? except for one selected meridian. The 
integral of the 2-form w = sin 0 d0 A dg (A.22) then equals 


27 T 
/ w >f a | sin@ do = 47 , (A.127) 
2 0 0 


which is the area of a unit sphere. This example shows that it is not strictly necessary 
to actually triangulate the integration domain. It is sufficient to dissect it into open 
sets, each of which is mapped by a single coordinate chart to a polytope in R”. The 
subsequent triangulation of the polytopes is a formal step that need not be carried 
out explicitly. The example also demonstrates that we can get away with a single 
coordinate chart even if it does not cover the whole manifold. It is sufficient to 
cover it up to a subset of lower dimension. 


With all the necessary definitions out of the way, let me proceed directly to the 
most important result of integral calculus on manifolds: the Stokes theorem. Let c 
be a singular (p + 1)-chain on an orientable manifold M, and let w be a p-form. 


Then 
[ow =} w. (A.128) 
c ðp+1€ 


For a proof, see for instance Sect. 6.1 of [2]. The Stokes theorem puts under the same 
umbrella, and further generalizes, a number of well-known results from ordinary 
calculus and vector calculus. These include the fundamental theorem of calculus in 
R, the gradient theorem for line integrals in R”, the Green theorem for area integrals 
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in R*, the eponymous Stokes theorem for surface integrals in R?, and the Gauss 
theorem for volume integrals in R”. 

There is an intriguing connection between the Stokes theorem and a geometric 
definition of the exterior derivative that I alluded to at the end of Sect. A.4. Namely, 
the exterior derivative of a p-form œw can be defined in a coordinate-free manner 
through integration of w over an “infinitesimal” p-chain that is the boundary of some 
(p+1)-chain. This generalizes the usual definition of the divergence of a vector field 
in R? using its flux through the surface of an infinitesimal volume element. The key 
problem is then to assert the consistency of such a definition; see Sect. 36 of [11] 
for details. In this sense, the exterior derivative can be defined exactly so that the 
Stokes theorem (A.128) holds. Rather than a nontrivial result of integral calculus, 
the theorem may then be viewed as arising from remarkable ingenuity in defining 
the right differential structure on the manifold. 


Example A.21 


In (A.126), the integral of a p-form w is obtained by pulling w back to R?” using the 
map f : Sp —> M defining the singular p-simplex op. This idea can be generalized 
to maps between any two manifolds (see Sect. 7.8 of [3]). Consider two manifolds 
M and N and a smooth map f : M — N. Then, for any p-form w on N and a 
singular p-chain c on M, we have 


J w = fro. (A.129) 
fe) c 


Such integration of differential forms via pull-back is common in field theory. In 
this context, f can be thought of as a field defined on the domain M and taking 
values in the target manifold N. 

Suppose now that M is an n-dimensional manifold without boundary and œw is 
an n-form on N such that dw = 0. Then f mS is a topological invariant, that is, 
it does not change value under a smooth deformation of f. To see why, think of the 
deformation f —> f as a family of functions f, where t € [0, 1] is a deformation 
parameter and fọ = f, fı = f. Formally, f; defines a smooth map from M= 
[0, 1] x M to N. Since M itself does not have boundary, we can view Masa 
cylinder whose “bases” at t = 0 and t = 1 are diffeomorphic to M. Using the 
Stokes theorem, we then find 


pre- from fo fie 


=f _fro= | d( fo) =0. 
Ont41M M 


In the last step, I used the fact that exterior derivative commutes with pull- 
back (A.56) and the assumption that dw = 0. The invariance of f ms @ under 


(A.130) 
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smooth deformations of f can also be understood from the left-hand side of (A.129). 
Here the integrand w is fixed, but the integration domain f(M) is deformed. Upon 
tuning the parameter t from 0 to 1, fı (M) sweeps a singular (n + 1)-chain in N with 
boundaries fo(M) and f|(M). Invoking the Stokes theorem and the assumption 
dw = 0 then again asserts the invariance of J fm @ under smooth deformations of 


f. 


A.7.2 Riemannian Manifolds 


On a first encounter, the differential form approach to integration on manifolds may 
seem a bit abstract. There is however a conceptually simpler alternative in case the 
manifold M is endowed with a Riemannian metric. As explained in Sect. A.6.5, 
the metric induces a natural volume form (A.110) on the manifold. For any smooth 
function f : M —> R, f vol is then a well-defined n-form on M, which can be used 
to define the integral of f over M, 


f s=) fvol= f f(x)V e(x)d”x. (A.131) 
M M M 


The last, coordinate expression follows from (A.111) and is often used in physics to 
perform integration in curved space(time)s. The path leading to this expression can 
also be viewed as a natural consequence of Hodge duality, which maps the 0-form 
f tothe n-form x f = f x 1 = f vol. 


In mathematics, it is common to define a volume form on an n-dimensional 
manifold M as any n-form that is nonvanishing everywhere on M. One then 
proves that the existence of a volume form is equivalent to the requirement 
that M be orientable (see Sect. 6.3 of [3]). The definition of a volume form I 
gave in Sect. A.6.5 is more restrictive in that it is tied to the metric structure 
on M. Such volume form is normalized to unity in an orthonormal frame. It 
can thus be thought of as defining on M locally a generalization of the usual 
volume measure in R”. 


Let me append two remarks of caution for more rigorously-minded readers. First, 
the coordinates may be defined only locally. In such a case, one has to partition 
the manifold and apply (A.131) separately to each coordinate patch. Second, the 
expression (A.131) may appear less general than (A.126), which is well-defined for 
a p-form and a singular p-chain of any degree O < p < n. It is however easy to 
generalize (A.131) to integration of functions on any submanifold N of M. All one 
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has to do is pull the metric g on M back to N using the embedding of N in M. The 
integral over N is then defined as in (A.131) using the induced metric on N. 


Example A.22 


I will illustrate the use of (A.131) by calculating the length of a curve on a manifold. 
The curve is defined by a smooth map y : R — M and we are interested in the 
length of the segment T = {y(t)|t € [t, fo]}. Treating t as a coordinate on the 
curve, we pull the metric g on M back to F via 

a dy? 


ka reds (A.132) 
Y & = Sab ae ae š : 


The length of the segment of interest then follows as 


ty dy4 dy? 
pied peice (A.133) 
T ti 


dt dt 


This agrees with the distance along the curve (A.82), previously postulated ad hoc. 


A.8 Homology and Cohomology 


Most of our discussion of differential geometry so far was restricted to local 
properties of manifolds. However, we have already seen how the global topology 
of the manifold M may place restrictions on otherwise locally defined structures. 
For instance, there may be no globally well-defined frame on M. Also, the global 
topology of M may forbid some solutions of the Killing equation and thus reduce 
the isometry group of M; see Example A.16. 

Topology is a vast area of mathematics and I have no intention to cover even its 
very basics in a self-contained manner. Instead, I will conclude the appendix with a 
brief introduction to some aspects of topology, pertinent to the subject of this book. 
As stressed in Sect. 15.3, spontaneous symmetry breaking is intimately related to 
the presence of defects in the ordered medium. The classification of defects was 
historically the first major application of topology to physics. The classic paper 
by Mermin [12] gives a physicist-friendly introduction to the relevant branch of 
topology, called homotopy theory. See also Chap. 4 of [2] for a somewhat more 
mathematical exposition. In the present book, homotopy theory plays a marginal 
role. A reader that is altogether unfamiliar with the subject will get away with the 
basic definition of homotopy groups, found early on in the above references. What 
I will cover here are elements of de Rham cohomology. This neatly connects the 
global topology of the manifold M to the analytic properties of differential forms 
on M. Cohomology theory plays an important role in the construction of actions 
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for systems with spontaneously broken symmetry. It is therefore highly relevant for 
Parts III and IV of the book. 


A.8.1 Singular Homology 


Let me start somewhat indirectly by recalling the concept of a singular chain, 
introduced in Sect. A.7.1. The real vector spaces of singular p-chains on M, denoted 
from now on more precisely as C,(M, R), form a sequence under the action of the 
boundary operator (A.124), 


ðp+2 ðp+1 


dp ðp— 
c Cp (M, R) > C (M, R) > Cp- (M, R) > +. (A134) 


The spaces Cp(M, R) are infinite-dimensional. It is however possible to identify 
certain finite-dimensional spaces descending from Cp (M, R) that carry information 
about the topology of M. To that end, we first introduce the subspace of p-cycles, 
Zp(M, R) = ker dp, that is those p-chains that have no boundary. Likewise, the 
subspace of p-boundaries, By(M, R) = im ðp+1, consists of those p-chains that are 
boundaries of some (p + 1)-chains. Due to the fundamental property 0, o ðp+1 = 0, 
any p-boundary is a p-cycle. In other words, B,(M, R) is a subspace of Z,(M, R). 
The quotient space, 


Hp (M, R) = Zp(M, R)/Bp(M, R), O<p<n, (A.135) 


is called the p-th homology group of M. Miraculously, this space has a finite dimen- 
sion although both Z,(M, R) and B,(M, R) are infinite-dimensional. Roughly 
speaking, H,(M, R) is a formal vector space spanned on mutually inequivalent 
p-dimensional submanifolds of M that have no boundary and are themselves not 
the boundary of any (p + 1)-dimensional submanifold of M. 


Example A.23 


To see how the homology groups relate to the topology of M, it is good to have a 
look at some very simple examples. First, the homology groups of the 2-sphere are 


HS, RSR, H,(S*,R)~0, HMS, R) XR. (A.136) 


The Ho(S?, R) group reflects the fact that S* is connected. Namely, 0-chains on 8? 
are points on S* and their formal linear combinations. For any two points Pı, Py € 
S?, the formal difference Pı — P2 is the boundary of a 1-chain (curve) connecting 
the points. Hence any 0-chain equals, up to adding a 0-boundary, a multiple of a 
single point on S?. At the same time, any 0-chain is trivially a 0-cycle. It follows 
that Ho(S?, R) is one-dimensional. An obvious extension of the argument shows 
that in general, Ho(M, R) ~ RÉ, where k is the number of connected components 


of M. 
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The triviality of Hı (S 2 R) reflects the fact that there is no 1-cycle (closed curve) 
on S? that would not be the boundary of some 2-chain (area) on $°. Finally, as to 
H>(S*,1R), there is obviously one 2-cycle on $? that is not a 2-boundary, namely 
the sphere itself. The fact that H>(S*, R) ~ R formalizes the observation that any 
2-cycle on S? can be deformed to $? or its multiple. I will add without proof that 
the above results for the homology groups of the 2-sphere readily generalize to the 
n-sphere with any positive n, 


Ho(S",R) > AACS", RVR, H,(S",R)~O foranyO<p<n. 
(A.137) 


Let us look at another simple example, the two-dimensional torus TŻ, for which 
Ho(T?,R)~R, H\(T?,R)~R, Hy(T?,R)~R. (A.138) 


The reasoning behind the zeroth and second homology groups is the same as for 
the 2-sphere. The two independent generators of Hı(T?, R) arise from the two 
possibilities to wind a circle around the torus so that its “inside” does not lie on 
the torus. Just like for the sphere, (A.138) is but a special case of a more general 
statement valid (without proof) for any n-torus with positive n, 


H,(T",R) ~ RO) for anyO < p<n. (A.139) 


As a side remark, the dimensions of the homology groups can be combined into a 
single quantity characterizing the topology of the manifold: the Euler characteristic, 


xM = yop dim Hp(M, R). (A.140) 
p=0 


For the n-sphere and the n-torus, we have x(S”) = 1 + (—1)” and x (T”) = 0. 


The above example provides basic geometric intuition behind the concept of 
singular homology. By no means does it explain how to actually find the homology 
groups for a manifold that is less trivial to visualize than a sphere. The problem how 
to compute homology groups belongs to algebraic topology. In physics-oriented 
literature, one usually encounters the older concept of simplicial homology. This 
makes it possible, although not straightforward, to find the homology groups by 
a “triangulation” of the manifold. An interested reader will find more details in 
Chap. 3 of [2]. I will instead proceed directly to the connection of homology to the 
Grassmann algebra of differential forms on M. 
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A.8.2 De Rham Cohomology 


How can the topological notion of homology be connected to differential forms? 
The basic idea is that any p-form w on M can be thought of as a linear function on 
singular p-chains on M, 


vies cos fo, cEC,(M,k). (A.141) 


c 


Thus, w can be identified with an element of the dual space C “ (M, R) of Cp(M, R). 
Moreover, we can set up a structure on differential forms that is “dual” to (A.134), 


. È QH M É gM EME... , (A.142) 


where Q? M denotes the space of differential p-forms on M. Following the analogy 
with (A.134), we then define the space of p-cocycles, Z?(M) = ker d|or m, as 
consisting of p-forms with vanishing exterior derivative. Such forms are usually 
called closed. Likewise, the space of p-coboundaries, B?(M) = im dļor-1 m, 
consists of those p-forms that equal the exterior derivative of some (p — 1)-form. 
Such forms are called exact. Due to the fundamental property (A.52), every exact 
form is automatically closed, hence B? (M) is a subspace of Z? (M). The quotient 
space, 


H?(M) = Z?(M)/B?(M), O<p<n, (A.143) 


is known as the p-th de Rham cohomology group of M. According to the Poincaré 
lemma, every closed p-form w can be locally written as the exterior derivative of a 
(p—1)-form o, w = do; see Chap. 9 of [3] for a proof and numerous examples. The 
de Rham cohomology classifies obstructions to the global validity of the Poincaré 
lemma. It therefore gives us access to global topological information about the 
manifold M. 

To see the connection between de Rham cohomology and (singular) homology, 
consider a p-cycle c and a closed p-form w. The Stokes theorem (A.128) guarantees 
that the integral of w over c does not change if we shift w by an exact p-form, or c 
by a p-boundary. In other words, for any (p — 1)-form o and any (p + 1)-chain d, 


I (o+do)= fot faos f o+ | do 
c+tad c c dd dd 
= for for faw f aaoy= fo. 
c dc d d c 


where I for simplicity dropped the subscript on the boundary operator. What this 
says is that the integral f ¿® only depends on the equivalence class of c as an 
element of H,(M, R), and on the equivalence class of w as an element of H? (M). 


(A.144) 
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The pairing (c, w), defined by (A.141), therefore represents a bilinear function on 
H,(M, R) x HP (M). The de Rham theorem asserts that when the manifold M is 
compact, this bilinear map is nondegenerate. This means that (c, œw) vanishes for all 
closed p-forms w if and only if c is a p-boundary, and for all p-cycles c if and only if 
is exact. It follows at once that the spaces Hp (M, R) and H? (M) are isomorphic. 
The bilinear map (c, w) establishes a duality between (singular) homology and (de 
Rham) cohomology. 


The statement of duality is highly nontrivial. It only applies to the quotient 
spaces Hp(M, R) and H?” (M) and not separately to the spaces of p-cycles 
and p-cocycles (closed p-forms), or p-boundaries and p-coboundaries (exact 
p-forms). Luckily, out of all the different classes of p-chains, it is the 
homology group H,(M, R) that is of greatest interest. The de Rham theorem 
then allows us to compute the homology groups of the manifold by means of 
the exterior calculus of differential forms. 


Example A.24 


There are no (—1)-forms on any manifold, and hence no exact 0-forms. Thus, 
H°(M) consists of all closed 0-forms. The condition df = 0 for f € Q°M that 
should be closed is satisfied if and only if f is a piecewise-constant function, that is a 
constant function on every connected component of M. As a consequence, H°(M) 
counts the connected components of M. This is in accord with our previous result 
for Ho(M, R) and the de Rham theorem. 

For an illustration of the cohomology groups of higher degree, let us consider the 
2-sphere S*. We know from Example A.23 and the de Rham theorem that H(S*) ~ 
H>(S*,R) ~ R, which guarantees the existence of one linearly independent 2-form 
that has a nonzero integral over S*. This can be naturally chosen as the volume 
form (A.22). On the other hand, the first de Rham cohomology group of S? is trivial 
since Hı ($°, R) is. We conclude that on the 2-sphere, the Poincaré lemma holds 
globally for 1-forms; any closed 1-form is necessarily exact. 

Somewhat more interesting than the 2-sphere is the 2-torus 7”. Here we deduce 
from Example A.23 that H'\(T*?) ~ Hi(T2,R) = RÈ. It is easy to guess what 
the two independent nontrivial closed 1-forms might be. Just think of T? as the 
Cartesian product S! x S!. This gives natural angular coordinates on T?, 012 € 
(0, 277). The forms do! d0? are closed but not exact since the angular coordinates 
are not globally well-defined. A different way to view this is to think of the basis of 
H; (T?, R), Ca, as two independent circles that wind around the torus. Then œf = 
d6“ /(27r) is the corresponding dual basis of H'(T7) in the sense that (cg, œ?) = 8b. 
As to H?(T?) ~ H(T?, R) ~ R, it is likewise easy to guess that its single generator 
can be taken as d0! ^ d@2, that is the volume form on the torus. 
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It is straightforward to generalize the above observation to the n-torus with any 
positive n. All we need to do is think of T” as (S!')*” and introduce a set of n angular 
variables 0°, a = 1,...,n. We then expect that the p-th de Rham cohomology 
group of T” is generated by the set of p-forms of the type dO A - - - A d6%”, where 


a\,...,@p is a set of different indices. This indicates that H?(T") ~ RO), which 
allows us to understand the corresponding result (A.139) for homology. 


Example A.25 


Example A.21 shows that closed forms can be used to construct topological 
invariants. One considers a smooth map f : M —> N defined on the n-dimensional 
manifold M and taking values from the target manifold N. If M has no boundary 
and w is a closed n-form on N, then the integral f m f*@ does not change under 
smooth deformations of f. But if œ is exact, then so is f*@ and its integral over M 
necessarily vanishes by the Stokes theorem. Nontrivial topological invariants can 
therefore only arise from nontrivial cohomology equivalence classes on N. 

A simple example of a topological invariant is provided by the volume 
form (A.22) on $°. Let us consider a function defined on R? and taking values 
on S*. Following the notation of Example A.7, I will denote the function using a 
unit vector, n : R? — S°. In physics, n may represent for instance a configuration 
of a two-dimensional (anti)ferromagnet. Suppose now that our function tends to a 
constant at infinity. We can then treat the infinity as a single point, which effectively 
turns R? into a compact manifold without boundary. (It is common to say that we 
have topologically compactified R? to $2.) Using the expression for the volume 
form on SÊ in terms of n and pulling it back to R?, we get the topological invariant 


1 

w[n] = =Í n - (Oxn x dyn) dx dy , (A.145) 
An R2 < 

where x, y are the Cartesian coordinates in R?. The normalization factor 1/(47) is 

conventional and ensures that w[m] only takes integer values. This is the Brouwer 

degree of n, treated as a map S? — S?. 


Let me conclude the appendix with a lightning summary of some further 
interesting properties of de Rham cohomology. A curious reader will find more 
details in Sect. 6.4 of [2]. For an n-dimensional compact manifold M, let us define 
a pairing of a p-form w and an (n — p)-form o as the bilinear map 


(w,o) >f ONO. (A.146) 
M 
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It is easy to see that if M has no boundary and both w, o are closed, then the result 
of the integration depends only on the respective equivalence classes of w, o as 
elements of HP (M) and H”~?(M). Moreover, (A.146) as a bilinear map H”? (M) x 
H"~?(M) — R turns out to be nondegenerate. It follows that the spaces HP (M) 
and H”~?(M) are isomorphic. This fact is known as the Poincaré duality. 


Example A.26 


We already saw the isomorphism H?(M) ~ H”~?(M) at work in Example A.24. 
The Poincaré duality however also allows us to draw some generally valid conclu- 
sions. For instance, for any compact connected manifold M without boundary, it 
tells us that H"(M) ~ H°(M) ~ R. Let us unpack the meaning of this statement 
in the special case that the manifold M is Riemannian. Then any n-form w is 
represented as œ = x f in terms of some function f on M. The representative 
of the single nontrivial equivalence class of n-cycles can be taken as M itself. By 
the de Rham theorem, œ% is therefore exact if and only if f m Í = 9. The exactness 
implies that w = x f = do for some (n — 1)-form o. This leads to 


f =xdo = (-1)"8«% o = (-1)""! div(xo)’ , (A.147) 


where I used (A.114) plus the definitions of codifferential and divergence. In plain 
terms, the function f is a divergence of some vector field if and only if the integral 
of f over M vanishes. This generalizes a statement frequently used in physics. 


Sometimes, the manifold in question has the structure of a Cartesian product, 
M x N. Any differential form w on M can be lifted to M x N. Formally, this is 
done using the projection zm : M x N —> M which maps a point (x,y) E Mx N 
tox € M. Then, for any w € QIM, rž% is a q-form on M x N. Analogously, 
forms on N are lifted to M x N using the projection my : M x N > N. Now for 
any closed q-form w on M and any closed r-form o on N, 14, A myo is a closed 
(q+r)-form on Mx N. Moreover, the cohomology equivalence class of 1, w^n Ko 
depends only on those of w and ø. We can thus obtain some information about de 
Rham cohomology groups of M x N if we know those of M and N. What is less 
trivial is that all cohomology generators on M x N can in fact be recovered in this 
way. This is expressed by the Kiinneth formula 


H? Mx N= Q [HUM eH), (A.148) 
qtr=p 


where the & symbol denotes tensor product of vector spaces. 


428 A Elements of Differential Geometry 


The de Rham cohomology groups of the circle, S!, are H°(S') ~ H'(S!) ~ R. 
The former follows from the fact that S! is connected, and the latter most easily 
from Poincaré duality. Using the Künneth formula (A.148), we can now recursively 
construct all de Rham cohomology groups of the n-torus. Let us see how this works 
for the 2-torus, T? ~ S! x S!. The zeroth cohomology group is of course R since 
the torus is connected. The first cohomology group is 


H! (S! x SH) ~[A% Ss!) @ A'S!) a [H! (Sh) @ AS!) ~RORZR. 
(A.149) 


The second cohomology group is similarly 
H’(S'x S!) ~ H'(S')@H'(S')~R@RER. (A.150) 


With the same reasoning, one then proves by induction that the de Rham coho- 
mology groups of the n-torus are H? (T”) ~ H?((S!)*") ~ RY’, as guessed in 
Example A.24. 


There is a simple relation between de Rham cohomology groups and homotopy 
groups that can be utilized to deduce the former from the latter. The precise 
statement is provided by the Hurewicz theorem (see Sect. 22.3 of [4]). Let M be a 
compact manifold and let 7,(M), p > 1 be the first nonvanishing homotopy group. 
Then HP (M) is the first nonvanishing de Rham cohomology group (with nonzero 
degree) and H?(M) ~ z,(M), if the latter is treated as a real vector space.’ With 
the additional assumption that 71 (M) is Abelian, the same statement holds also for 
p = 1. Let me illustrate the use of this theorem on an example of great relevance 
for the classification of effective actions in Chap. 8. 


With a bit of homotopy theory, we can find the second de Rham cohomology 
group of the homogeneous space G/H, where G and H are Lie groups. What we 
need from homotopy theory is the exact sequence of homotopy groups (see Sect. 9 
of [12]), 


+++ —> (G) > m1 (G/H) > ngk- (H) > m-1(G) >: . (A.151) 


7 This is a slight abuse of terminology. The Hurewicz theorem as presented in [4] really connects 
1 p(M) as an additive Abelian group to the homology group H,(M, Z) with integer coefficients. 
When translating Hp(M, Z) to Hp(M, R), which is by the de Rham theorem in turn isomorphic 
to H?(M), any finite part of H,(M, Z) is lost. 
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This implies that if both 2% (G) and z,_1(G) are trivial, then mg (G/H) ~ m_1 (1). 
Let us assume that G is compact and simply connected and H is connected. Then 
m\(G) ~ m0(G) > {e}, and hence 71(G/H) ~ m0(H) ~ {e}. The homogeneous 
space is itself simply connected. 

Next, we use the fact that 22(G) is trivial for any compact connected Lie 
group (Appendix B of [13]). Hence m2(G/H) ~ xı(H). But we have already 
established that G/H is simply connected. The Hurewicz theorem then tells us that 
H'(G/H) ~ 0 and H?(G/H) ~ 12(G/H) ~ 1(A). To summarize, let G be 
a compact and simply connected Lie group, and let H be its connected subgroup. 
Then the second de Rham cohomology group of the homogeneous space G/H is 
isomorphic to xı (H) as a real vector space. Since H is by assumption compact, this 
means that the generators of the second de Rham cohomology group of G/H are in 
a one-to-one correspondence with U(1) factors of H [14]. 
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